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‘he explanation of «elementary » particles structure, and therefore the pre- 
ion of their characteristics (mass, spin, coupling constants with the various 
) has been the object of much research proceeding mainly from two con- 


oe 


1 The concept which we call « realistic », which is an attempt to explain 
true ure of elementary particles, considered as formed in a definite manner 


At present with the Theoretical Study Division of CERN (European Organi- 
for Nuelear Research) at the Institute for Theoretical Physics. University of 


942 B. JOUVET 


by certain spin } particles, supposed more elementary. The main difficulty 
lies in the way to state, and then to resolve, the problem of the binding of 
fermions without being given the medium of the boson fields a priori (*). 


— At the beginning of research in this field, this first concept did not 
appear to be satisfactory, and more formal methods proceeding from a phe- 
nomenological concept of elementary particles were able, if not to explain | 
the relations between the particles, at least to give the form of the equat-. 
ions of the particles supposed free and without structure (?). Attempts : 
have been made for several years to propose new frames and new prin- - 
ciples to represent or deduce either the masses and spins or the coupling: 
of the elementary particles (non-local theories, spin or charge space, mass 


‘ 


operators...) (3). 


It is the purpose of this paper to present a new form of realistic» 
concept of mesons unlike the previous proposed models (1). We limit our; 
examples to the pseudo-scalar interactions. We shall show that in order: 
to unify Yukawa theory of nuclear forces and Fermi theory of beta decay 
(Sect. 2) the choice of a Fermi coupling between the various pairs of fermions 
(nucleons and leptons) allows us, when using the regularization and a certain 
summing up of the terms in the S matrix (Sect. 3), to demonstrate the exis- 
tence of mesons (bosons) whose masses and coupling constants with the various: 
fermion pairs are expressed with the properties of the fermions only (Sect. 4)..| 
The construction of a theory of Yukawa type equivalent to the realistic theory: 
allows us to give the rules for the treatment of every term of the S matrix., 
For certain values of the regularization constants, Yukawa theory is renor-+ 
malizable (Sect. 5). Various problems concerning the elementary particles: 
are suggested (Sect. 6). This work is the generalization of a more4 
restricted attempt to explain the photon by postulating a Fermi vector cou- 
pling between the electron and neutrino pairs (attempt to which we referi 
here as « V°») (4). 


(') L. pe BROGLIE: Theorie des particules à spin (methode de fusion) (Paris, 1943)= 
C. N. YaxG@ and E. FERMI: Phys. Rev., 76, 1739 (1949); G. WENTZEL: Phys. Rev., 794 
710 (1950); W. HeIrseNnBERG: Zeits. f. Naturf., 5a, 251, 367 (1950); Physica, 19, 8971 
(1953); H. YUKAWA: p. 287 in L. de Broglie, physicien et penseur (Paris, 1952). 

(2) A. Proca: Journ. de Phys., 7, 347 (1936); M. Frerz; Helv .Phys. Acta, 12, 44 
(1939); W. PauLI and M. Firrz: Proc. Roy. Soc., 173, 211 (1939). 

(?) M. Born: Rev. Mod. Phys., 21, 463 (1949); B. T. DarLING: Phys. Rev., 804 
460 (1950); H. Yukawa: Phys. Rev., 91, 415, 416 (1953); S. WATANABE: Phys. Rev,A 
91, 771 (1953); Pars: Physica, 19, 869 (1953); A. Proca: Journ. de Phys., 15, (1954) =) 
O. Hara and T. MarUMORI: Nuovo Cimento, 12, 309 (1954). | 

(4) B. Jouver: Journ. de Math., 38, 201 (1954) or These de doctorat, Paris (1954) 
A summary of V° and of the present paper is published in Compt. Rend. Ac. Sci., 287. 
1642 (1953); 238, 55, 327, 454 (1954); 239, 1121, 1267 (1954). 
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2. — The Theories of Nuclear Interactions. 


Every theory of the nuclear interactions tries to explain, with the smallest 
number of concepts, the five interactions (at least): 


Interactions nucleon-nucleon: (nuclear force). 
Interactions nucleon-leptons (e, v, u): (B-decays). 


) 
) 
) Interactions nucleons-7 mesons: (diffusion, absorption or emission). 
) Interactions leptons-7 mesons: (7 mesons decay). 

) 


Interactions leptons-leptons: (u mesons decay). 


In quantum theory this problem reduces to the choice of what particles 
and what couplings between these particles can be considered as existing fun- 
damentally. 

We review briefly various solutions. 


a) The theories F (theory of Fermi, or « pair theories ») (5). — The exist- 
ence of fermions and couplings between the pairs of fermions is enough to 
provide an explanation of the interactions 1, 2, 5, but this theory does not 
consider the interactions 3 and 4. 


b) The theories Y (theories of Yukawa, or « mesons theories »). — The first 
research on elementary particles led YUKAWA to invent the meson and meson 
coupling with nucleons, foreseeing interaction 3 and explaining 1. For some 


years, the lack of experimental precision and the confusion of m-mesons with 
u-mesons permitted the hope of explaining the interactions 2 and 5 by 
coupling meson-fermions, giving also rise to the interactions 4 and 5. This 
hope no longer exists. 


e) The theory Y+F. — Since the theories a) and b) are valid for some 
linteractions and not for others, we are led to postulate that there exist funda- 
nental mesons and fermions coupled with YUKAWA coupling. This postulate 
explains interactions 3, 4 and 1. Then, to explain the Fermi interactions 2 
and 5, we also postulate the existence of a fundamental coupling between the 
fermion pairs. Thus, the interactions are the result of two types of coupling F 
| r Y, with F larger than Y for the interactions 2 and 5, and Y larger than F 
‘or interaction 1. However, to be consistent, it would be necessary to show 
that the Fermi interactions result effectively from the existence of Fermi 
Bi line. But one does not know how to do this because Fermi coupling is 
hot renormalizable in the sense of Dyson. Then, for lack of a theoretical proof 


(5) See for bibliography: M. TANNENWALD: Phys. Rev., 86, 332 (1952). 
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that the corrections to Fermi coupling due to the closed loops of fermions 
containing unknown constants are negligible, one uses the experimental fact 
that the Fermi interactions appears to be almost independent of the energy. 
However this empirical argument seems to us not sufficiently convincing tel 
justify this bicephal theory. 

Then a convenient treatment of the Fermi coupling has led us to the pos-. 
sibility of constructing a new type of theory. 


d) The realistic theory of elementary particles (theory R). — We postu-: 
late that there exist only fermions (nucleons and leptons), and Fermi coupling, 
between the various pairs of fermions. According to the realistic concept of) 
elementary particles, one must be able to deduce from this hypothesis alone; 
not only the form of the interactions 2 and 5 but also the existence of boson) 
particles and the form of their coupling with fermions, and then the inter- 
actions 3 and 4, and therefore 1. To show that this program is realisable 
is the purpose of this paper. It will appear that the theory R is equivalenti 


to a more general Yukawa theory than the usual one. 


e) The generalized Yukawa theories. - One can avoid the difficulties con. 
cerning the form and the value of the Fermi interaction deduced from the 
usual Y-theory by postulating the existence of heavy boson mesons couplet: 
more strongly with leptons and more weakly with nucleons than the mesons 
now known. Such an hypothesis, and the many other which are necessary 
to explain the high energy nuclear features (hyperons...), often appear arbitrary) 
and thus it is preferable to have a theory of particles. However the price wa 
must pay in the present R-theory is the reject of Dyson renormalization methoo 
and the introduction in the theory ‘of a finite number of arbitrary constants: | 
in term of which the masses and coupling constants of the boson particles ard 
expressed. | 


3. — General Equations and Methods. 


a) Particles and coupling. ~ The only particles that we suppose to exis) 
fundamentally are the fermions forming two families, nucleon and. leptonsi} 
We suppose they have spin.}. ‘We suppose that the only neutral lepton is th 
neutrino, and denote the leptons by Ly, La, Ls... Li (Lo = neutrino). 

Let n and p denote the neutron and proton, and N, the particles of thi} 
nucleon family (n, p, hyperons). | 

In this paper we are interested only in the charged ‘pairs of fermions. All 
the coupling constants in the charged Fermi coupling are supposed to havi] 
the same value, g, of unknown numerical value. 

Two pairs which are identical or charge conjugate are supposed uncouplec 


tl 
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The coupled pairs are choosen so that the charge is conserved in elementary 
processes, and the Lagrangian is Hermitian. 
In precise examples, we will utilise the interaction Lagrangian: 
‘ AN Ee ai ae ae tae aT | 
(Bal) D APP) + (Pos Pa)(Wr,7sY2,)} + 
ES 
t 
ees § (>) a) 47 Se ear 4 a) ì 
+ Da IP s,/sPr)(Pr VP r,) + PP} è 
t<5 


(Charged pseudoscalar couplings, omission of hyperons). 


b) Regularization, normalization and renormalization. — As is well known, 
the quantum theory of fields, is not usable if one does not add certain principles 
to allow a non-ambiguous treatment of the infinite expressions. These prin- 
ciples, which often appear as rules for calculus because their meaning is un- 
known, are the principles of renormalization and otherwise of regularization 
which is associated with a «normalization » for theories which are not renor- 
malizable. This last occurs in the present case. 


We call «regularization », the method which allows us, first to separate 
in a covariant way the infinities in an ultraviolet divergent expression and, 
second, to substitute for these infinities some arbitrary constants. This is a 
mathematical operation, always possible, which associates to every Feynman 
graph an explicit expression containing a finite number of arbitrary constants. 
One can imagine that this method supplies a deeper understanding of the 
infinities problem. The ways to perform the regularization though generally 
equivalent, are not alike in the concepts that they represent (°). 

After regularization we must interpret the physical meaning of the resulting 
expressions, by studying the effect of the arbitrary constants in various con- 
erete physical situations. Two cases can occur: 


— The first possibility is that the theory is « Complete » which means that 
all the arbitrary constants appearing in the theory can be interpreted as 
corrections to the fundamental constants of the theory, generally contained 
in the Lagrangian and in the waves normalizations. The existence of inva- 
riances of the theory under certain transformations generally permits us to 
eliminate or to give definite values to some regularization constants (gauge 
invariances of Maxwell-Dirac theory, or Fermi V° coupling (*)). 


— The other possibility is that the theory is «Incomplete» so that the 
regularization constants represent not only corrections to the constants of the 


(8) W. PauLI and VILLARS: Rev. Mod. Phys., 13, 203 (1941); W. PauLI: Feld- 
quantisierung, E.T.H. lectures (Zirich, 1950-1951); L. DE BRrOoGLIE: Compt. Rend. Ac. 
Sci., 229, 401 (1949); W. GOTTINGER: Phys. Rev., 89, 1012 (1953). 
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theory but also introduce new types of physical concepts not foreseen in the 
foundations of the theory (Lagrangian); these concepts are new fields or part- 
icles and new forms of coupling between the fields (example, the arbitrary 
meson-meson coupling appearing in the PS theory of mesons). 

Once this interpretation has been made an infinite sum of terms depending 
on an infinite number of arbitrary constants appears in the calculus of a 
given effect up to an infinite power of the coupling constants. Thus the: 
fundamental constants of the theory are corrected by an infinite number i 
of arbitrary corrections. For this reason, these corrections are not usable. 
In order to avoid this difficulty the «normalization » principle is defined (7). 


— Principle of normalization for complete theories. The arbitrary constants è 
are chosen so that the corrections they give to the fundamental constants & 
of the theory (contained in the Lagrangian), for any order in the development 
of the S matrix, are null. For example, in meson theory the meson mass and | 
the coupling constant of mesons with nucleons are normalized by postulating & 
that the nucleons closed loops carried by the meson (B%(p?)) satisfy the con-4 
dition (B(p?)/(p?+-2 


2) Varennes 0; therefore: 


Bi(— mn?) = 0, Î 


This method is used in electrodynamies when one postulates the gauge 
invariance of the Lagrangian, which is equivalent to postulating the nullity; 
of photon mass in order to eliminate in the S matrix the corrections to the4 
photon mass. 

When no more than three wave functions are coupled in the same coupling: 
as in Maxwell-Dirac electromagnetism, the normalization procedure can be 
expressed mathematically as the conservation of the poles and residues ap-: 
pearing in every degree of approximation of the S matrix. 


— Principle of normalization for incomplete theories. One method for thes 
treatment of the physical quantities not included in the Lagrangian consists! 
in completing this Lagrangian by introducing convenient terms containing! 
new constants to be determined by experiments (example, the introductioni 


(*) We do not at all think that the actual arbitrary regularization and normal-} 
ization process constitute a satisfactory solution of the infinities problem. But for} 
want of a renormalization method able to treat the fundamental problems of particles 
structure (mass and coupling), the proposed method is equivalent to Dyson’s, when 
it applies and otherwise, must give satisfactory results if the true, but now unknown, } 
series of corrections to the fundamental constants of the theory are strongly convergent.’ 


See, on these problems, the LorenTZ- KAMERLINGH ONNES memorial papers in Phy soa,» 
19, (1953). 
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of Ap* coupling in meson theory). Another way is to retain the first constant 
of a new type appearing in the S matrix and to normalize the other constants 
of that type so that the corrections they add to the first constant are null. 
However, the true difficulty of the normalization of theories is the physical 
definition of the constants. It consists in the choice of the most convenient 
physical conditions under which a certain interaction has a given value. This 
difficulty appears more important when there exists a coupling between more 
than three wave functions, as in the meson-meson coupling of the Yukawa 
theory and in Fermi couplings; these cases will be discussed in Section 6. 


c) The primitive interaction kernels. — The «pair theories » (*) consider 
only the lowest order interaction terms between two nucleons, via lepton 
fields. The usual treatment of the Fermi theory takes into account only the 
first order term in g for the nucleon-lepton interaction. On the contrary, let 
us define the « primitive interaction kernels », (UV), represented in Feynman 
graphs as the sum of all the «chains » constitued with any number of closed 
loops having only two vertices, and connecting a fermion line at a vertex 
where the pair is U to another (or the same) fermion line at another vertex 
Where the pair is V (8). 

As examples, with the coupling postulated in (8.a), the charged pairs 
an be: 


UE V= (np), (Lo, L) ee (Lo, Ly) © 


Then we have 


(ei ossi Elea a 


wu u uu wen u W-UUM W y © u 
eeu 
Y~ rw 
(3.c1) 
(OY), = =e sù > SE IF Ò: SSeS sata onto 
u v WAU U Www wAynu w © v 
W =v Lev 
Wr 


(*) S. KamerucHI and H. UmezawA: Prog. Theor. Phys., 8, 579 and 9, 529 (1953), 
have shown that such an iteration of loops with two vertices allows us to treat the 
«second kind» interactions, including the Fermi interactions. However our paper 
differs from theirs in the method of renormalization, in the demonstration of the pos- 
sibility of treating completly this interactions, and mainly in the physical interpretation, 
since these authors have not seen that the Fermi theory is, in the whole, equivalent 
to the Yukawa theory. 

This iteration method, is a generalization of the damping theory of Heitler (Proc. 
Camb. Phys. Soc., 37, 291 (1941)) and Wilson (Proc. Camb. Phys. Soc., 37, 301 (1941)). 


eth 
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The S matrix elements due to these interactions between free part-4 


icles are 


en) S(UV) — Ja PIP VIK, (p) + y° aS PETE 


we x 


with Ji(p)J'(— p) = ERA e Pe FD): 
-d(p,— Ps Pigi 


and an analogous formula for S(UU). 

Here, the p, are the momenta of the four fermions denoted by the indices 4 
a, B, y, 6, paired as U= (a, 8) and V= (y, 6). The operator 0* is one of thet 
five Dirac matrix tensors (S, V, 7, A, P). The constant y = ig/2x. The fune-4 
tions K,.(p), associated with the closed loop formed with the pair W= (A, #),, 


are 
Ke 3 9) = | d*x exp [— ipz| K; (2), 
(3.63) | w(P) Ky lu (p) =| I [ pa | As 124 
VERA ae Ne Pl. sul), Tie) Sty gla) == via) O° Aaa | 
Then, one primitive interaction kernel in impulse space is 
(3.c4) (UV(p)) =y+y? > Kulp) +? YKsPKxp) +... 


In this paper it is not necessary to specify the tensor index i. 

The algebraically simple form of the kernels results from the fact that the! 
whole impulse is carried in one point of the vertices. More complicated integral! 
equations appear when we take into account the radiative corrections between | 
different loops, that is to say, when we are concerned with closed loops having! 
more than two vertices. This case will be considered in Section 5. | 

It is simpler to utilise the equations of which the kernels are solutions rather > 
than the series. One shows easily, from the properties of the Feynman kernels, | 
or by direct substitution, that the primitive interaction kernels satisfy the! 
equations (in impulse space). 


| (OLY ey, Dea 


(3.€5) wu 


| (UV)=y+yY(UW)K,=(VU)=y+y Y(VW)Kw.l 


WEY wi 
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The solutions are 
QE 1 ord K te 213 VIGILE Qu TR I 1 
(OM OD Aa O > alt 10 Si EG Kg areas 
wu ew LZYUZW 
CW U 2.Y7,W FU 
‘ 2 TEST È f I 9 E © 7 re 
(3.06) COV) = an: DAR RR) 
4 WU yw x 
V,WHUyHv 
Vol ¥ > Kok — 29° > KK, Ky...) 
yscw wy Ww 


The series appearing in this expressions have a finite number of terms if 
the number of fermion pairs that we consider is finite. 

In the solutions we have omitted arbitrary distributions which appear in 
the factor 1/D, (the expression should be 1/D+c0(D) where e is an arbitrary 
constant). However as we have shown in V°, this constant is determined by 
the physical condition that the interactions must be causal, which is true 
because, the pair exchange is causal. This effect is to shift the poles of the 
kernel by an arbitrarily small imaginary quantity, as in the definition of the 4, 


functions. 


4. — Physical Meaning of the Interactions: the Mesons. 


a) Form of the interaction kernels. — After regularization, every closed 
loop having two vertices is a function of the total momentum-energy of the 
pair and of two arbitrary constants. Without any loss of generality one can 
suppose that these two constants express some properties of the fermion pair 
forming the loop. Then for the loop formed with the pair W= (4,4), one has 


(4.a1) IRE ip) 


(p =impulse, OC and OC? the reg. const.). 


For some interval of values p?< p°< p;, the function ,,(p°) can be appro- 
ximated with a better approximation when the Cl? are greater and the interval 


smaller 
(4.42) Kap yy AF s(p? + Ma): 


The unidimensional constant F,, and the constant M,, (dimension M?) are 
functions of the Of constants and generally depend weakly on the interval. 


In this approximation (4.42), the kernel (UU) is the ratio of two polynomials , 
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the degree of the denominator D being n, and of the numerator n — 1, where 


n is the number of different pairs (20 = 1, 2, ..., n). Then one can write 
ORTO STA na 

(4.43) (UU) & DI ee 

The numerators of the kernels (UV) with wv have degree n— 2, and the 


denominators are the same. Thus 


= AT : E 
(4.a4) (UV oe with, Si 0 


l=1 Dè dti 1=1 


The last equality is not important, because it is due to the approximation (4.42). 
In a more general way, let us define independently of the approximation (4,42) 


~ TIT Ne) p 
4.45 CRA) = + fora nani ee 
J 2 si ia 1 FG 2 2 2 
(4.46) Dei ; [(OV(p?)) (0° + mi) orse. 


The constants 7? and A/, defined in this way are calculable functions of fer- 
mion masses and C*constants of all the pairs, and thus depend only on the 
properties of the fermions. 


b) Existence and properties of mesons. — We limit now the discussion of 
the preceding equations in postulating that the constants A’, and 27 are po- 
sitive real quantities. 

It is then clear that the poles of the kernels (zeros of D) can be interpreted, 
according to the identity (in the approximation (4.42)) or to the similarity 
(in the general case (4.46)) of the kernels (UV) with the kernels that would 
give a Yukawa theory postulating the exchange of mesons of mass 7, between 
the fermions. x 

But, and this point is essential, to be able to interpret the kernels (UV) 
as resulting from exchange of mesons, one must be able to define the coupling 
constant between the 7,meson and the fermion pair uw, G/, through the 
relations 


(4.51) AL = GG. 
Now this is possible only if there exist the identities 
(4.52) AIAR 


between the constants A/,. 
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From (4.45) and (4.a6), one sees that this identity is satisfied if 
(4.53) [DN tp?) (0) = 0 when (p°+ 2) =0. 


We show in the Appendix that, resulting from the hypothesis of the coup- 
lings (3.4) (namely that the coupling between identical pairs is null, and that 
the coupling constant between different pairs is the same constant g), this 
identity is satisfied. The reason is that the quantity in brackets in (4.53) is 
proportional to the function D(p*) which, by definition (4.46) is null at every 
pole, independently of the complex value of the constants 4%, and 7ì. This 
result is also independent of the tensor form of the coupling. It is possible 
that there exists a more general choice of the relations between the Fermi 
coupling constants, which allows the possibility of interpreting the kernel with 
meson concepts, but the essential point we want to underline here is that the 
equality of the Fermi constants for different pairs of fermions (np-ev, np-uv, 
ey-uv), which is verified experimentally, implies the existence of bosons through 
which the Fermi interactions occur. 


c) The mesons equations. — The field operator of meson must be some 
combination of the field operators of fermion, since the functional space is 
determined by the eigen-vectors of the free Hamiltonian which is function of 
only the fermion fields. Let us write the S matrix for the scattering of two 


fermions: 


(4.c1) Sis > IS (UV (@ — 2") da da’ . 


In the approximation (4.42) one has 


‘sì ae FS 3 L, ql. 7 Ai ey rel 
(4.¢2) (OV Ga) ir BEG tee ee eee 
U 
where the Y, functions are the Yukawa potentials associated with mesons 7;. 


Then 


(4.03) N "| Sa, > Gia) de, 


u l 


with the definition of the meson field operator //,(w) 
(4.c4) We eal =| VY (a= a \(> Cr D(a") da' : 
We have 


(4.¢5) EI). 
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The exact form of the meson field operator, independently of the approximation 


(4.a2) is given in Sect. 5. 


d) The free mesons. — In the S matrix formalism we cannot represent 
the free mesons (present in + infinite time), and the mesons appear always 
as intermediate states. Physically such a situation seems satisfactory since 
we do not know stable mesons. However, as Prof. H. UMEzAWA has pointed 
out to us, if the poles are real and positive quantities (77? > 0), the addition 
of an arbitrarily small imaginary quantity required to define the 4, functions 
implies the existence of outgoing and ingoing free waves which are not null 
at plus or minus infinity, which is inconsistent with the S matrix formalism. 
The way to switch off the meson waves at infinity is to introduce a finite 
imaginary part in the mass definition giving rise to an exponential decrease 
of the wave amplitude, associated with an increasing of the free pairs of 
fermions. Such a situation appears in this theory if, and only if, the masses 77 
are larger than the energy necessary to allow a real creation of some fermion 
pair; indeed the A,,(p?) funetions present a finite imaginary part when 
— pe > (m,+ m,)*, and then the D function is a complex one (°). If further 
information on this problem makes this interpretation correct, then to avoid 
in V° theory (*) the presence of photons at infinity, it would be necessary to 
have a finite imaginary part of the mass. According to this theory, this would 
mean that the neutrino mass is not absolutely null. However, this has no effect 
on the gauge invariance, which results only from the vectorial form of the 
coupling and not from the nullity of the neutrino or photon mass. 

Another possible way to attack the problem of particle structure is to uti- 
lise some non-adiabatic method (TAMM-DANCOFF or BETHE-SALPETER). However 
if it is possible to obtain the masses of the mesons in this way it is not pos- 
sible to deduce also the meson-fermion coupling constants. Furthermore, we ha- 
ve difficulties in this method, in interpreting the part of the mass operat- 
or having complex value. 

This difficulty seems connected with the fact that one cannot consider the 
mesons as formed by fermions « bound » together in the usual sense of this. 
word, which implies the notion of persistence of every constituent. In our 
concept, the meson is successively in every possible state of virtual pairs of 
fermions, and it appears as some « fusion » state due to the fluctuation of the 
pairs. The same difference exists in the prediction of the mass of «bound » 
or «fused » particles. The mass of the particle resulting from the binding of 
two particles is, for ordinary couplings, not very far from the sum of the 
masses of the constituents. In fusion, on the contrary, one has no idea of 


(*) The presence of complex poles in fermion self energy contributions, and mathe- 
matical difficulties thereby arising have been indicated by G. FELDMAN: Proc. Roy- 
Soc., A 223, 112 (1954). 
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the resulting mass, and the intuitive idea that the mass is some mean value 
of the masses of pairs is wrong, as can be shown by the example of the 
photon for which the mass is independent of the masses of electron, proton 
and other charged pairs treated on the same footing as the neutrino pair. 


e) The mesons in beta decay. — As a result of the preceding discussions 
(Sect. 2, and 2.6), the beta decay necessarily occurs through the meson fields. 
Precise experiments would be able to detect the mesonie influences. They 
can be a modification to the lifetime due to interaction of the virtual mesons 
with the meson, nucleons, coulomb field of nucleus, or a possible bremsstrahlung 
of the mesons before the decay. We note that the actually known mesons 
are not sufficient to explain the experimental result (2.5). 


5. — The Realistie Meson Theory and the Yukawa Theories. 


We treat now precise examples, using the Lagrangian (3.41). 

a) The Yukawa charged nuclear forces. — Let us suppose, as a first appro- 
xlmation, that the regularization constants of the lepton loops are infinite 
{5.a1) K, = infinite 

50 
and that the only other pair is the charged nucleon pair (np). Then the primi- 
tive interaction kernels are 
2 LL) =0 ME aa (NN) Le 

3 iy hye) == 6 fe - IN = = "Re ZAR 
(5.42) (L;L;) (NL) Peste 
where we denote L; = (L;, Lo), i = 1, 2,..., 1; and N= (np). In this approxi- 
mation (5.a1) there exists interaction between the nucleons and not between 
the leptons nor between leptons and nucleons. We see that D generally has 
one pole. Thus the theory may be equivalent to the usual Yukawa theory of 


charged nuclear force. 


x) One method of treating the interactions. — In a general way, we write 


the A, function in the form 
(5.a3) K (p?) =a + b-(p?+ 2°) + (p?+ 2?) R(p*), R(— x?) =0 


the constants a and d being functions of the Ci, and C% regularization constants 
of the nucleon loop (the exact form of these functions depend upon the method 
of regularization, and it is not necessary to make this dependence precise here). 
In this formula we supposed that the meson mass is known. This is without 
justification since it is a function of the C°.. Let us, however, choose the ©, 
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constants in such a way that we have 


(5.a4) dl = — 1 [ly P b Dea d(G:2) ; 
Then 
ae 1 
(5.a5) (NIN) == 4G? 


(p? + n°)(1 —iG?R(p?)) * 


We recognize here the interaction kernel produced by the exchange of a meson, 
the propagator of which is modified in the first order in G? by the vacuum 
polarization in a Yukawa theory in which the meson is coupled with the 
nucleon pair by the constant 27G. It is the Dyson function he in which we 
have omitted the terms proportional to degrees in & higher than the second. 
We denote this function as 47. 

Then, in the same way as for the photon (Chap. 9-A in V°) one can show 
that, from the equality (VN) = iG2/47', (see equat. (5.a5)), the value of a S 
matrix element in R theory, represented by the sum of every graph formed 
with certain open lines of nucleons and certain closed loops of nucleons having 
more than two vertices but connected from one vertex to another by « chains » 
having any number of closed loops with two vertices, of lepton and nucleon type, 
is equal to the value of a corresponding S matrix element, in the Yukawa 
theory, formed by the same open lines of nucleons and the same nucleon closed 
loops having more than two vertices, but connected from vertex to vertex by 
first order corrected meson lines having the value ASI It results from this 
theorem that, in the (5.01) approximation the R theory is identical to the 
Yukawa theory ignoring the lepton fields. 


B) A second way of treating the interactions. — Although the first method 
is able to lead to the usual Yukawa theory, we think that this treatment is 
not physically correct because it implies the a priori knowledge of the meson 
mass and it chooses the constants in order to obtain exactly the usual Yukawa 
theory. Rather, let us write 


(5.46) (NN) = tA/U(p’, g, Ch, OF) 


and define the experimental constants (A and 2?) by 
Lia 70, I Ck; ty) =0-7?= *(g, Ie 9 Cy) i) 
(5.47) È U 


a Lpd Ck; Iw) 3 
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Conversely, the regularization constants Ci. are 
ae t i 
(5.a8) Gp == Cul, Gg) « 


To be able to generalise the equivalence theorem between the R-theory and 
some Yukawa theory Y,, as we have done in the preceding method, let us 
define the free meson propagator (1/X(p2) by the condition that the interaction 
kernel between two nucleons, due to the exchange of this meson, the pro- 
pagation of which is corrected to the first order in G2 by the vacuum polari- 
zation terms conveniently normalized, be identical with the primitive interact- 
ion kernel (NN) of the realistic theory 


(5.09) fig = Re Pia di i 


From this there results 


(5.a10) AX (p?) = U(p?) + iG? Rp?) (p+ n°). 


Dial 


The function X(p?) is generally not equal to (p?+-2?), because 


| nee 4 (12 dk 
Sr (Ap?) eet 


(0409) iS 


is different from zero, except for particular values of the C’, constants (that 
of the preceding case) (*). Thus 


(0.012) X(p*) = (p?+ a?)(1 + Z(p?, 2°)), 4-28, x?) =0- 


and the Hamiltonian of the free meson is generally not a linear function of z?. 
Then the meson mass renormalization method is not suitable and we are 
obliged to use the normalization. The charge renormalization can still be made 
with Dyson formalism. Also there results from the condition (5.49) that the 
Yukawa theory Y,. in which the free meson equation is defined by 


(5.a13) xp 1p) 07 


where X(p2) is given by (5.410), is identical to the realistic theory. From this 
identity, one obtains immediately a simple way to treat every effect deducible 


.(*) See correction in the Note added at the end of the paper. 
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from the S matrix. The R-theory is then renormalizable, or normalizable, 
according to the values that we give to the first regularization constants Cs 
If it is renormalizable, it is equivalent (in this approximation (5.a2)) to the 
usual Y-theory, and if not one sees that the difference from the usual Y-theory 
lies in the non-linearity of the meson equation, mostly affecting the results 
in the virtual meson processes. 


b) The mesons theory of beta decays. — There exist nucleon-leptons and 
leptons-leptons interactions when the function Ki. are finite. A new appro- 
ximation of the R-theory is 


(5.b1) VE; | > 1 and finite, wu = independent of p? for |p? < M%, 4 


We shall show that this approximation gives rise to the m-meson theory of 
PS beta decay. 

In this approximation the D function has only one pole with low mass. 
We interpret this pole as the z-meson. Taking for A, the same function as 
in (5.a), one sees that 


(5.52) (= Ni, Ti SI (e), 


(Ga NES Lj K,, 


G,, = coupling constant between z7-meson and L;Ly pair. 


‘One obtains the experimental G constants for coupling of m-meson with (pv) 
and (ev) pairs in choosing 


(5.53) yE,~5:10%, -yK,~1:5-10*. 


Evidently, in such an approximation, the Fermi interaction resulting from the 
exchange of the 7-meson alone is too weak, mainly the Fermi interaction between 
the lepton pairs. This results from neglecting the p? dependance of K L; fune- 
tions, which gives rise to new and heavy bosons more strongly coupled with 
leptons and less with nucleons. 

However, although this is a very crude approximation, one sees appearing 
a law for the cut-off which seems to have a profound significance: For the 
pairs of a same family, the magnitude of the function K,, is larger when the 
mass of the pair is smaller. One can derive the behaviour of this law from a 
theorem of Heisenberg (!°) which implies the impossibility of having a local, 
causal and covariant interaction between null mass particles. Then if we postu- 
late a Fermi coupling between two pairs of fermions, one having a null mass, 
the interaction between two null mass fermions through the virtual pairs 


(!°) W. HEISENBERG: in L. de Broglie, physicien et penseur (Paris, 1953), p. 283. 


4 
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(which is causal, local and covariant) must be null. This is the case only when 
the null mass closed loops have an infinite value. Then. the regularization 
constants must be choosen so that they tend to infinity when the pair mass 
tends to zero. This theorem applies quite generally and permits us 


(i) to explain the photon structure, 
(ii) to forbid the two neutrino decay of x°-meson, 
(iii) to show the impossibility of a local interaction between photons. 


6. — Generalization of the Realistic Theory. 


a) The renormalization. — From the equivalence theorem, between the 
R-theory, and the Y,-theory we are led to normalize or renormalize according 
to the general scheme of sect. 8. Physically we preserve the properties of 
mass and coupling constants of the new particles which result in the summing 
of a certain class of graphs. Mathematicaly, this preservation is equivalent 
to the hypothesis that the poles and residues occurring in this sum of graphs 
must remain the same when we take into account the other graphs of the 
S matrix. However, it is not clear in this method why the known beta inter- 
action have almost the same value which appears rather independent of the 
exchanged energy. This point can be cleared up only by a better knowledge 
of all the mesons through which this interaction occurs. Another way to re- 
normalize, given by KAMEFUCHI-UMEZAWA (*), starts, on the contrary, 
preserving the Fermi constant so that at zero energy the Fermi interaction is 
equal to the Fermi constant. However one does not understand the physical 
meaning of the terms introduced in the Lagrangian for renormalization. Fur- 
thermore, the authors are not concerned with the poles of the kernels, and we 
do not know if the corresponding masses are also renormalized de facto. In 
R-theory we obtain an equality of the Fermi interactions for low energy, by 
postulating that every y,,(p?) function has the same value when p? is null. 

In our method we dispose of 2L arbitrary constants if the g constant is given 
for every coupled pair (L = number of unlike pairs); then there are (at least) 
L mesons each coupled with all the L pairs so that we have L(L +1) exper- 
imental constants to determine the value of the 2L arbitrary constants. There- 
fore, there must exist relations between the various mesons and one can hope 
that a better knowledge of the light mesons will help us in research on the 
heavy mesons. As the R-theory gives a relation between the number of fer- 
mion pairs and the number of bosons, so the Y-theory, when it postulates many 
bosons of the same tensorial nature, must also postulate a corresponding number 
of fermion pairs to be renormalizable (*). 


(*) These points are more precisely discussed in the Note at the end of the paper: 
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b) The fermion mesons. — In the same way as in the preceding sections, 
one can also build a realistic theory for the complex structure of fermion 
mesons formed by a definite fusion of certain bosons and certain fermions 
supposed more elementary and given a priori. An example of such a meson 
appears in the problem of z-nucleon scattering, and is due to the pole pro- 
duced by the iteration of nucleon self-energy graphs. However as in sect. 4.d, 
this pole can be interpreted as a new meson (here an hyperon, Y) only in the 
S matrix formalism which allows to define the coupling form (w4,0*wm)®- Gz y- 

The general argument that we have given about the bosons applies also 
in this case so that the mass of these new fermions must be complex. This. 
can happen either if the mass of the fermion meson is larger than the mass 
of its constituents or if the constituents are also meson (complex) particles. 
The problem of renormalization in this case has already been resolved by the 
Dyson method. Apparently, the only difficult problem lies in the choice of: 
the most important fundamental particles and in a satisfactory treatment of 
the strong coupling pion-nucleon. Thus one can hope that if this rather long 
calculations can be carried out one will be able to construct by degrees all the 
mesons by postulating some fundamental Fermi coupling between a very re- 
stricted number of «most elementary » particles (electron, neutrino, proton 
and perhaps neutron) (*). 


¢) The problem of four fields coupling ("*). 


x) The Fermi couplings. — In the main, we have shown that the Fermi 
coupling is equivalent to a sum of Yukawa-Lorentz type coupling between the 
given fermions and new boson particles (mesons) always unstable. There 
appears also an additional coupling term between the meson wave functions. 
(except in V° case) as the Agy* usual term of Y-theory, introduced by MATTHEWS. 

Let us now start from the same Lagrangian, and make a Fierz trans- 
formation (#2); then the coupled lepton and nucleon pairs will be reorganized 
in mixed lepton-nucleon pairs coupled together with different coupling constants 
and different tensor forms, giving a new form for the Lagrangian. The new 
meson theory that one can deduce from this new Lagrangian must be identical 
to the preceding and then there must exist relations between the new and old . 
types of mesons. For the coherence of the 8 matrix formalism, these new 
mesons must have masses larger than the proton mass in order to prevent 
the presence of such meson at infinity by the imaginary part of the mass. 


(*) See Note added at the and of the paper. 

(11) We are indebted to H. UMEZAWA for informing us that 8. KAMEFUCHL has. 
published a paper on this subject (Prog. Theor. Phys., 11, 273 (1954)). This review is - 
not arrived in Paris, and it seems from Phys. Abstract, 10789 (1954), that this author 
has not reached the physical conclusion given here. 

(12) Frerz: Zeits. f. Phys.; 104, 553 (1937). 
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The hypothesis of such mesons has been introduced in the papers of TANI- 
KAWA and TANAKA (!*) These mesons decay in nucleon plus lepton, 
(n, p, Y,...) + (e,v, u...), according to the value of their mass. These authors 
showed also that only scalar and pseudoscalar mesons of this type are suf- 
ficient to deduce the Yang-Tiomno interactions. Then, according to our theory, 
one can show that theS and PS Fermi couplings between the mixed pairs (nucleon 
and lepton) quite suffice to explain the known charged interactions. The 
R-theory allows us to investigate the existence and the properties of the 
neutral Fermi couplings. Among the neutral Fermi couplings that experiment 
indicates, certainly, the Vectorial and PS exist which give rise to the photon 
and z°-meson(*). The existence of 0-mesons which may be scalar can be ex- 
plained in two different ways. The first way is to derive their existence from 
S-Fermi couplings (charged and neutral) between the fermion pairs. In this 
model the resulting meson will be called « primitive » like the 7-mesons. 

A second way to construct S mesons exists in the PS mesons theory: 


P) The Matthews term in PS meson theory. — Indeed, for the 494 term, 
one can try to apply the same treatment that we have done for the Fermi 
coupling. However, a new difficulty arises from the fact that one can iterate 
the meson closed loops in every vertex in many different ways, because one 
can group in pairs the four mesons lines of a vertex in three different ways. 
The same difficulty occurs also in Fermi interaction when instead of the Lagran- 
gian that we have considered one uses the sum of the three different La- 
grangians obtained by Fierz transformations. However, due to the apparent 
success of the iterative method in Fermi interaction, we shall suppose that 
the combination problem of iteration of graphs also has a solution in the 
meson-meson interaction. Then, as in the Fermi coupling treatment, there 
appear two arbitrary constants, the Z constant of coupling, and the cut-off 
constant in meson closed loops with two vertices ¢,. Pole and its residue 
produced by the iteration method are interpreted as Abe to a new meson having 
a certain mass and a coupling constant with PS meson pair. This meson is 
scalar and its structure is different from the S meson constructed with the 
S-Fermi coupling. We call it a « secondary » meson, and to recall its PS-Fermi 
coupling origin, we note it S,, 

As an application, if the A constant is large, there is no reason to neglect 
it in the calculation of the nuclear potentiel. Its action is important as soon 
as there are more than two mesons in flight; but then the most important quan- 


tity is the pion-S,, coupling constant. In the same way this constant must: 


be introduced in the pion-nucleons scattering, before the usual iteration of 
nonadiabatic methods. 


(3) TANIKAWA: Prog. Theor. Phys. 3 338 (1948); TANAKA: Prog. Theor. Phys. 9, 
169 (1953). 
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As a conclusion of this rather imprecise study we want to point out the 
fact that it seems generally possible to substitute for a more than three field 
coupling an equivalent description, using additional field quantities and new 
coupling containing no more than three flelds. 


7. — Conclusion. 


To summarize this paper, we have shown that, with the use of regu- 
larization and a certain way of ordering and summing the matrix elements, | 
the Fermi coupling can be completely treated. The main result is the inter- 
pretation of the method in terms of new particle concepts. Then we derive ; 
from the existence of Fermi coupling between fermion pairs the properties : 
of boson mesons through which the Fermi interactions must occur. The ex- - 
tension of this method and of the physical interpretation can be made, and | 
one reaches the conclusion that every meson, boson or fermion, can be built | 
from the smallest number of fundamental particles (electron, neutrino, proton, , 
and perhaps neutron), requiring no essentially new principle, or new methods. 
However, the use of regularization, introducing in the theory arbitrary constants | 
to be determined by experiment need to be improved. It is in research on | 
the law of the regularization constants (whose qualitative form is given) and 
in its further interpretation that one can make progress. Most of the problems & 
underlined in Sect. 6 require further works. We shall treat in our next publi- 
cations more precise examples of the applications of this theory: 

— The PS theory (structure of mesons, bosons and fermions, charge sym-: 
metry and independence of nuclear interactions). 
— The electromagnetic properties of mesons (charge and photo-decays)| 
— The V, PV, T interactions (allowed meson coupling forms, gauge inva-: 
riances). 


* KOK 
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APPENDIX 


Demonstration of formula (4.53). 


The kernel equations are (3.c5) 


a) o, 

b) (UV)=y+yY(UW)KE,=(VU)=y+yY (VW) Ka, 
i WAY has 

then 


(UU) = y[(UV)-K, + > (UW)-K,— (UU)-K,] 


y(UV)-K,+ ((UV)—y)—y(UU)-K,, from b) 
or 
(UV): (1 + yk.) = y + (UU) (1+ yK,) 


and by permutation of « and » 
(VO) =2 yk) = y (VV): (1 + yK,) . 
The product of these two equations is 


ee a (ley Te) (ee a) =O OU) (VV eB) (de yK) H+ 
and using da) 


[(UV)}:- (00) (VV) + yu) + ye) = + [XY (OW) + (VW) La] 


writing 
TTT Ni 
(OV) = —— for every « and v; 
D I 


2 { D =i Sal Nig ua Now) K ri > 
Na Car INDISSE — 93. o x D 
( ) ¥ | (ds yK,)(1 i + yK,) “| 


q.e.d. 


Norn ADDED IN PRO OR. 


I meant to have a paper (R) published in Nuovo Cimento under the title 
of « A Realistic Theory of Mesons ». Owing to an unexpected delay, two other 
papers presented at the Pisa conference, vid., 

(E): Les Couplages de Fermi et la Théorie des Bosons 
(F): L’exclusion des FantOmes 


962 B. JOUVET 


will be issued before the first paper (R). I therefore endeavoured to write both 
of them so that they might be thoroughly understood without precise reference 
to (R). 

I am going to deal with the alterations and generalizations that have occured 


1. — On the mesons equations. 


Corrections and additions to the Sect. 5a and 64 of (R). The paper (E) is a 
general mathematical demonstration of the Equivalence of the £-theory to 
a certain Yukawa theory, when no real meson is present. This theorem has 
been very shortly indicated at the end of sect. 5ax of (R). The application 
of (E) permits to show that the mesons equations are linear functions of the 
Dalembertian, and to clarify their meaning and the methods of renormalization. 


Aa) One pair; first order normalization. 


Let us first correct the wrong result of non-linearity of mesons equations 
(R. 5af) which comes from the fact that we did not utilise the explicite form 
of the (NN) kernel. The equation (R. 549), (which is the condition (UV,) = 
= (UV;) of (E)), has the form 


= | = = ig anÌ 


SEM Leo, po SIE EI 
RE \x(p)A — PFGK2/X(p%)) 


| heel Sie 


n) 


We use here the notation y’ for (— /y), and K£ for the normalized (or renor- 
malized) expression for the closed loop B? such that KZ(—2?) = 0 .and 
((0/@p°)KE(p?))_x = 0. Please remark that the (NV) kernel is identical to 
the kernel which would result from a direct coupling between nucleon pairs 
with the coupling constant g’= — ly. Expressing A in term of the regularized 
expression for closed loop 62, the function Ky, it follows 


(N1.2) Kip) = Ky(p") — Kx(-a) — Ka) + 2°) 


a A dio A ; 


I 


Then, from (N1.1) and (N1.2) one obtains 


/ 


N1.3 Ì WGA a = GX (2) — is - = 

Gast) Sg oh estiiies <i) 
SAAS ES a 

(N1.4) i si + (Ky(p*) -- Kx(p°)) = 


aus I Jar 2 mG a 3 ‘ 
== nal y(— 2 ) = Ky(— 2 )-(p = Jt ).. 
yet 
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EP") È a linear function of p*; the constants A,(— 7?) and Ky(— 2?) are 
infinite if one does not use regularization, and are arbitrarv otherwise. 
Ab) One pair; infinite order normalization. 


One can generalize the previous result. Let us define by 82° the closed 
loops of nucleons having two vertices from which emerge external pairs, and 
q vertices between ‘which internal pairs are exehanged; for example 


The sum of all the « chains », constitued with any number of such closed loops 
2, È a | ~ . e OO . . . 
BS, defines a « General interaction kernel » (NN), which is a function of the 


« General closed loop » By = ¥ BY’ the value of which is Kx(p?), 
a—>0 
(N1.5) Nera. 
1—-v'Ky 


Let us define also the « General normalized interaction kernel of the Yukawa 
theory» which is iG?A,; {4, is the Dyson’s function (4,) = X(p?)- 
(1 — iG2K®/X(p?))}. In the same way as before, one generalizes the equi- 
valence theorem (E), by determining the mesons equations in such a way that 


(N1.6) iG2AL =: (NN). 


Tt results also that the bare meson equation is a function of the infinite part 
of the function Ky, and is linear. 


Ba) L pairs; first order normalization. 


To demonstrate the equivalence theorem (E), one must previously calculate 
the’ first order normalized kernels of Yukawa theory (UVi). To do this, one 
chooses the arbitrary constants of the X,, functions, in such a way that poles 
and residues of the (UV}) are the same as those of (U VY). But one disposes 
generally of 20 arbitrary constants to satisfy L(Z + 1) conditions (L poles 
or meson masses, L2 residues or coupling constants); therefore, except if there 
exist 12 + L —2I, — L(L —1) relations between the mesons constants one 
cannot normalize this theory to the first order. The F-theory gives these 

— 0 


. . . . . > n L Mh 
relations. one of which is the relation between the coupling constants > GG, = 
t 


if uw  v. However, even if these relations are fullfilled in lower approximation, 
they are not in higher approximations. 
Bb) L pairs; infinite order normalization. 


Let us define as B%! the irreducible closed loops which have the following 
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properties: 
— They have 2 vertices from which emerge external pairs; the pairs 
‘arried by enclosing line are of u-type at one vertex and of r-type at the other. 


— They have q other vertices between which internal pairs are exchanged. 


— The irreducibility is the following topological property: Let #, and «, 
be the positions of the two «external» vertices in a two dimensional mapping; 
then, every open line crossing the loop Bî between these two points, and 
cutting lines of the loop only at the vertex points other than +, and w,, does 


not ever cross only one vertex. 


Let us then define the functions K,,, represented by the sum of all these 


DD 
closed loops, By, = d Biz; if w= the smallest q is null and if u=# the smallest 
a 
q is two, or three when Furry’s theorem applies. 
The « General interaction kernels in R-theory », are then functions of the 


~ 


Kn; in a same way as in (R&:3e), one obtains 
ONTO (UVa) =o Vuv =| (U Ga Wa Git , 


we denote here y,,= 191/27, Yu» being the coupling constant of the two pairs 
u and v (for an exaniple of this calculation in the case of 2 pairs, see: V° 
Chap. VITI, F). Let us now define the « General normalized interaction kernel 


x 


in the Yukawa theory », (UVz) by the formula 


n 


(N1.8) (Ue (Ur) AO WAI) 

in which, (UV}) denotes the «bare mesons interaction kernel » 

(N1.9) COV IGG Aap) Xp) = pas 
1 


and Ke. the general normalized closed loop functions, obtained from Kos 
Let us explain this calculation. Tf the general closed loop functions, Kg have 
the same types of divergences as the first order closed loops K,, used in previous 
sections, they contain 2-Z(L + 1)/2 arbitrary constants after a regularization 
of these expressions in the whole. One can thus determine, in normalization, 
all the constants, and this is equivalent to impose two conditions to every 
Ke, functions. As before, one has 


~ 


(N1.10) Ko — Ky, LE db) = binomial Pi 


P,, 18 the infinite part of A, before regularization; after it, F,,, and Duy are 
arbitrary constants; P,, is a pure imaginary quantity. The condition 


(N1.11) (Given Eye 


sit 
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determines the mesons of the equivalent Yukawa theory. After some manipul- 
ations of the equations (N1, 7, 8, 11) one gets 


“aa Wa} BY ° 


(N1.12) SESTA 


It results that the field equations of mesons are linear functions of joes ne 
bare meson field operator, JZ’ is defined by 
(N1.13) SEC eh OS UN aE ke 
; 3 
which gives the well known result when using the explicit form (N1.9). 

As a conclusion, let us remark that, if the series of approximations of the 
unrenormalized closed loop expressions is strongly convergent, then P,,, (w + v) 
are smaller than P,, (the first order P,, (vu # ©) is of the same order of ma- 
gnitude than second or third order P,,,). If one neglects them, one gets the 
relations between the coupling constants and the masses of the various niesons. 
underlined in Ba. This will perhaps provide an experimental test of the con- 
vergence of series expansions. 


2. — On the Heisenberg theory of elementary particles (1). 


It is useful to discuss some conimon features and ditferences between the 
Heisenberg theory and mine, since both are related to a realistic concept of elem- 
entary particles. The Heisenberg way starts from the only neutrino field, 
non-linearly coupled with itself through a Fermi tvpe coupling, and calculates 
the various excited and compound states of this field (deductive way). 

We use, on the contrary, an inductive approach: 

— In a first step, mostly developped in (V°) and (R), we determine the 
Fermi ¢ouplings between the various fermions, supposed elementary, in such 
a way that the resulting bosons are the experimentaily known particles 


‘ 
(photon, pions, ...). 

— The second step will consist, when we suppose given the bosons and 
the «most elementary » fermions, in constructing the fermions mesons, in 
such a way that they are coupled by the Fermi coupling determined in step 1. 


(1) I am now referring to a series of papers which were unfortunately unknown 
to me when I sent paper (R) to be published: W. HEISENBERG: I - Nachr. d. Gottinger 
Akad. Wiss., S. IIl (1953); II - Zeits. f. Naturf., 9a, 292 (1954); III - To be published 
(in collaboration with F. Korver and H. Mrrrer). I am very indebted to Prof. rr 
SENBERG for giving me a preprint of his last paper (III), of whieh I discuss mostly here. 


Pa 
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The solution of this problem, must indicate what are (or is) the most elementary 
fermion fields, and their coupling, (vy, e, p, n, or v alone?). 

Other differences appear for the treatment of the equations. 

The Heisenberg equation and the associated method leads to finite results. 
Every closed loop is finite, because it must always be caleulated with the 
use of a propagator S(p) very strongly regularized as a consequence of the 
non-linear equation. The masses are calculated by using non-adiabatie methods, 
but, as pointed out in R.4d, Heisenberg is led to follow a way similar to that we 
used in (V°) and (R), to calculate the coupling constant (fine structure constant). 
A phenomenon analogous to our « Deutrino », (see V°, Chap. XH, XIII), 
appears also in his solution of the null mass boson problem (first term of 
equations 67 and 68 of III). 

Now for a synthesis of our two theories (*). Let us postulate that, in the 
end, my way of approach will show that one can actually construct every 
particle with the only nentrino field, conveniently coupled with some Fermi 
type couplings (according to the Heisenberg idea). In this model, one must 
be able to show (step 2), that, starting from neutrino field and various boson 
fields given «a priori », one obtains all the fermions which are coupled according 
to the Fermi coupling calculated in step 1. But, this is equivalent to start 
from the general propagator of neutrino field, (more precisely from the fermion 
interaction kernel between two events where two pairs imteract), which contains 
the various excited states of the neutrino. 

The Fermi couplings that we have calculated in step 1, and that the step 2 
must explain, have the following properties: 

In all the cases when it was possible to compare two Fermi constants, they 
were found to have the same absolute value. This property has been indicated 
in (F) as resulting from the charged Fermi interactions measurements; but 
this is not the most convincing test, since the experiments give the value 
of the interactions and not the value of the coupling constants. The previous 
property results mostly 

a) from the equality of magnitude of electric charge of p and e; 
b) from the equality of magnitude of mesonic charge of p and n (*). 

In two cases these constants have opposite signs. 


(?) We give here some details about the arguments given in the paper (F). 

(3) Let us remark here that the charge independence of nuclear forces, (Gon)? = 
= (Gi)? = — (G2,)(G3,) = 3(GÉ)?, that we have deduced in Compt. Rend. Ac. Sc., t. 
239, p. 1267 (1954) from the coupling 


> gt (Yn? wp) (Pry? Ya) + (pny? pn) = ¥ (Ppy>Yp)) (py y; ae 2° Py) } + @.C. 


t=Lept 


in using the approximation of (R.5a1), results only from the fact that the Fermi constants 
of the pairs (nn) and (pp) have the same absolute value, and opposite signs. The com- 
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ft results from the last property that step 2 leads to the existence of 
imaginary Fermi couplings (Ghost couplings), which appear necessarily asso- 
ciated with the convenient couplings. Then, either one must exclude the 
Heisenberg idea, or there must exists in nature some kind of « Ghost coupling 
exclusion principle ». I admited in (F) the second eventuality. The demon- 
stration of this principle has to be formulated in accordance with the general 
formalism of quantum mechanics. Some consequences of this principle are 
investigated in (F). 


3. — A new frame for mesons theory. 


We shall briefly review the general restrictions imposed to the relations 
between the various mesons, as it results from the papers (R) and (F). We 
use a phenomenological classification of «absolute » (A) and «non absolute » 
{N.A.) selection rules (S.R.). 


&).The Lorentz group S.R. - (A.). 
b) The Isotopic spin S.R. - (N.A.). 


A deeper reason for occurrence of isotopic spin quantum number, has been 
explained by the Fermi coupling (*): further calculations can now show the 
limits of its validity. 


e) The allowed meson couplings - (A). 


If one admits the postulate that there exist onlv the five types of Fermi 
couplings (S, V, T, A, PS) between certain fermions, then the resulting mesons 
can have only a few types of couplings with the fermion pairs; for example 
the PV coupting of PS mesons is excluded. 


d) The Causality S.R. - (N.A). 


The cut-off law, though quantitatively arbitrary, is however restricted by 
the causality condition, (R.5b). This restriction is more stringent when the 
particles resulting from decay have null masses; however, this S.R. does not 
apply when the interactions are non local. An interesting example is given 
by the u-meson decay (*); one cannot have the process u>e+B°->e-+(v*4v) 
(B° means some neutral boson) and, in the same order of the coupling constants, 
the process u—v-+B*—v-+(e+v*) is allowed (B° are some charged mesons). 
In high order processes, one can however obtain interactions between the neu- 
trino pair and the (eu) pair, by exchanging two charged bosons, which give non- 
parison of the Fermi constants of neutral and charged conplings can be done hy the 
calculation of the masses of charged and neutral pions, when the number of charged 


and neutral lepton pairs are better known. ' ' 
(4) We are indebted to Dr. L. Micun for an interesting discussion on this pomt. 
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local interactions. When the masses of the resulting particles are not null, 
the causality S.R. is only qualitative. We think that one must impute the 
preference decay 7 > u + v rather than 7 > e + v to a law of this type. Every- 
thing goes as if there existed some kind of «stepping law », which favours. 
decays giving rise to the smallest lost of mass. Perhaps the Heisenberg theory 
can explain this fact quantitatively. 

e) The Ghost S.R. - (N.A). 

We indicated in (F) a tentative application of this law in a simplified 
model, which shows also an illustration of a complicated interaction which 
weakens the S.R. It is also a problem to determine if these S.R. can be expressed 
as a consequence of some phenomenological new quantum number (Gell-Mann’s 
Strangeness, for example). 

f) The Charge conservation - (A). 


We introduce the charge conservation as a separate principle. But a realistic 
theory should be able to demonstrate this principle, in time. 


SUPPLEMENTO AL VOLUME II, SERIE X N. 4, 1955 


DEL NUOVO CIMENTO 2° Semestre 


Cinetica e meccanismo delle reazioni di sostituzione nucleofila (5), 


M. SIMONETTA 


Istituto di Chimica Industriale dell’ Università - Milano 


(ricevuto il 19 Gennaio 1955) 


Le reazioni di sostituzione semplice (nelle quali si ha lo scambio di un solo 
legame) si possono rappresentare in forma generale nel modo seguente: 


RX + Y=RY+4+X. 


In questa reazione si ha la rottura di un legame esistente e la formazione 
di un legame nuovo. Una prima distinzione si può fare in base al tipo di rot- 
tura del legame; possiamo avere scissione simmetrica 0 omolitica: R.|.X 
oppure scissione asimmetrica o eterolitica: R|:X. 

Tl primo tipo è comune nelle reazioni in fase gassosa, mentre il secondo è 
caratteristico di molte reazioni di sostituzione in soluzione. Le reazioni di sosti- 
tuzione eterolitica si dividono ancora in due classi a seconda che gli elettroni 
del legame che viene scisso rimangono a R oppure al gruppo sostituito X. 
Ciò dipende da due fattori: dal fatto che il gruppo che si stacca può avere la 
tendenza a separarsi con o senza gli elettroni del legame; e dal fatto che il 
gruppo sostituente può usare i suoi elettroni per attaccarsi al nucleo di R o 
usare il suo nucleo per attaccarsi agli elettroni di R. Le due classi di reazioni 
si chiamano rispettivamente nucleofile ed elettrofile. Possiamo allora costruire 
per le reazioni di sostituzione il quadro seguente. 


a) Reazioni omolitiche Y.+ hs) XS YVIREX. 


b) Reazioni eterolitiche 


1) nucleofile (S,) Y:+R|{[X>Y:R+X: 
2) elettrofile (S,,) Y+R:|[X->Y:R+X 


(*): Conferenza tenuta a Napoli il 17 Ottobre 1954, in occasione della XLV Riunione 
delia Società Italiana per il Progresso delle Scienze. 
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Siamo qui interessati alle reazioni di sostituzione nucleofila, indicate con 
il simbolo S,. Queste reazioni sono state estensivamente studiate da C.K. 
InGoLp, E. D. HueHes e dalla loro scuola; l’attenzione di questi autori si è 
rivolta in modo particolare al caso in cui la sostituzione avviene a un atomo 
di carbonio saturo, come succede, ad esempio, quando RK indica un radicale 
alchilico. È stata provata l’esistenza di due meccanismi possibili: 
un meccanismo a un solo stadio, o bimolecolare, S,,, ad esempio: 


~1/2— ~1/2— 


OH- + R—Cl — HO....R....Cl + R—OH + CI 


e un meccanismo a due stadi, o monomolecolare, S,, ad esempio: 


Na 
dt Go 
ReCI><R%..(C==Ra 32 Ch (lenta), 
R+ + OH- >R-0H (veloce). 


L'introduzione in R di gruppi atti a cedere elettroni al centro di reazione 
favorisce il meccanismo S, in confronto a quello S,; il contrario accade con 
i gruppi che attirano elettroni. 

Questo fatto è stato messo in evidenza molto chiaramente dallo studio della 
reazione di idrolisi dei cloruri alchilici in soluzione alcalina, come appare dalla. 
tabella seguente, dove per i vari gruppi R sono indicati il tipo di meccanismo, 
l’ordine della reazione e la variazione della velocità passando da un gruppo: 
all’altro: 


meccanismo SA i SF 

gruppo (KR) CH,— CH,—CH,— (CH,),CH (CH,),C— 
ordine della reazione na sd | il 
velocità — diminuzione > . — aumento —> 


Si vede che la velocità di reazione passa per un minimo, nella regione di 
transizione tra i due meccanismi. Anche la natura del solvente, in modo spe- 
ciale il suo potere ionizzante, può esercitare un’azione sensibile su queste rea- 
zioni in quanto un elevato potere ionizzante facilita un aumento di grandezza. 
delle cariche elettriche nel passaggio dallo stato iniziale al complesso attivato, 
ne ostacola una diminuzione e ritarda la distribuzione di una data carica. 

Tali azioni saranno più energiche nel caso di formazione o scomparsa di 
variche elettriche che non nel caso della semplice distribuzione di cariche esi- 
stenti. Così negli esempi prima riportati per i due meccanismi SEMO Sy, un 
aumento del potere ionizzante del solvente ritarderà blandamente il mecca-, 
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nismo a uno stadio mentre avrà un’energica azione accelerante nel meccanismo: 
a due stadi. Infine influirà sul meccanismo la natura dell’agente sostituente. 

Se esaminiamo una serie di sostituzioni con diversi agenti nucleofili sullo 
Stesso composto e nelle stesse condizioni, vediamo che, partendo con un rea- 
gente a grande affinità per il nucleo di carbonio con il quale la reazione pro- 
cede secondo il meccanismo S,,, e considerando agenti di affinità via via mi- 
nore, la velocità di sostituzione progressivamente decresce, sino a che si arriva 
al meccanismo S, ; dopo questo punto la velocità rimane naturalmente costante. 

. Volendo considerare i metodi sperimentali a disposizione per determinare il 
meccanismo effettivamente seguito nei vari casi occorre distinguere due tipi 
di sostituzioni nucleofile; solvolitiche e non solvolitiche. Nel caso delle reazioni 

non solvolitiche il gruppo sostituente viene da un’ reagente presente in con- 
centrazione piccola e determinabile. Allora in generale a un meccanismo SI 
corrisponde una cinetica del secondo ordine e a un meccanismo S,, una cinetica 
del primo ordine; quindi l’ordine della reazione osservato determina anche 
il meccanismo. 

Nelle reazioni solvolitiche il gruppo sostituente viene da una notevole fra- 
zione del solvente, che è presente in concentrazione sostanzialmente costante 
durante tutto il corso della reazione. In prima approssimazione la reazione 
risulterà del primo ordine, indipendentemente dal meccanismo. Si ricorre allora 
ad: altre tecniche, quali lo studio dell’effetto sulla velocità di reazione di cam- 
biamenti strutturali nel composto o nel sostituente o di cambiamenti di sol- 
vente, o di aggiunta di sali; oppure lo studio approfondito dell’andamento 
cinetico della sostituzione o il suo corso stereochimico. Quest'ultimo metodo 
si è rivelato di grande importanza in quanto ha portato alla soluzione rigorosa 
del problema dell’inversione di Walden. Precisamente si sono potute formulare 
le seguenti due regole: 


1) la sostituzione con meccanismo S, comporta inversione di configu- 
razione, indipendentemente da ogni particolare strutturale: 


racemizzazione, in generale accompagnata da un eccesso di inversione, a meno 
che non sia presente un gruppo che protegga la configurazione, nel qual caso 
la configurazione stessa viene in predominanza conservata. 


2) il meccanismo S,,, che passa attraverso uno ione carbonio, comporta 


Le reazioni di tipo S, sono state definite anche « polimolecolari » in quanto 
è stato messo in evidenza in alcune reazioni di idrolisi la partecipazione di 
diverse molecole di acqua nel complesso attivato del primo stadio della rea- 
zione, quello di formazione dello ione carbonio; queste molecole di acqua avreb- 
bero il compito di polarizzare il legame carbonio-cloro in modo da facilitarne 
la rottura. Recentemente è stata però riscontrata analoga partecipazione di 
un certo numero di molecole di solvente alla formazione del complesso attivato 
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anche per una reazione a meccanismo S,,, quale Vidrolisi dell’1-cloro-2-nitro- 
etano in miscele acqua-diossano, acqua-acetone e acqua-acido formico. 

La teoria delle reazioni di sostituzione nucleofila alifatica — che e stata 
qui riassunta nelle sue linee più generali — è rimasta fino ad oggi a uno stadio 
esclusivamente qualitativo. Ciò è dovuto in parte alla scarsa applicazione che 
hanno avuto finora i metodi della meccanica quantistica ai composti alifatici. 

Passando ad esaminare lereazioni di sostituzione nucleofila aromatica, mentre 
le idee fondamentali della teoria per i composti alifatici sono ancora valide, 
vedremo che si può giungere a una trattazione 
semiquantitativa del problema, applicando, con 
opportune semplificazioni, il metodo degli or- 
bitali molecolari. Salvo poche eccezioni, delle 
quali il più cospicuo esempio sono le decompo- 
sizioni di alcuni ioni diazonio, le sostituzioni 


CORSO DELLA REAZIONE 


nucleofile aromatiche avvengono secondo il 
Fig. 1. — Curva della variazione meccanismo a uno stadio, S,, e quindi limite. 

d’energia degli elettroni 7 du- remo le nostre considerazioni a questo caso. 
rante il corso della reazione. Le variazioni di energia libera durante il 
/ corso della reazione si possono rappresentare 

con una curva come in fig. 1. 

In questa curva il punto a corrisponde allo stato iniziale, cioè ai reagenti 
separati, ¢ ai prodotti della reazione separati e c al complesso attivato. L'energia 
libera di attivazione, misurata dal dislivello fra e e a si può scrivere nella forma 


AF= An +42 TAS, 


dove Az indica la differenza nelle energie degli elettroni x, AX la differenza 
nelle energie degli elettroni o, nell’energia termica e di punto zero (dovute al 
moto dei nuclei) e nelle energie di solvatazione, e AS è l’entropia di attiva- 
zione. Se ora riduciamo il nostro problema al confronto tra la reattività di 
due composti aromatici 1 e 2 di struttura simile o tra due posizioni 1 e 2 dello 
stesso composto, le differenze degli altri termini scompaiono rispetto alle dif- 
ferenze Ax e pertanto anzichè confrontare AF, e AF, possiamo paragonare 
Ax, e Ax,: vale a dire possiamo sostituire nella figura all’ordinata F Vordi- 
nata g. 

Per quanto riguarda lo stadio a sappiamo che il metodo degli orbitali 
molecolari (M.O.), nella sua approssimazione data da una combinazione lineare 
di orbitali atomici (L.C.A.0.), permette di calcolare almeno approssimativa- 
mente l’energia degli elettroni 7 in una molecola aromatica e la concentrazione 
elettronica ai vari nuclei, ma per lo stadio ¢ la situazione è diversa e occorre 
introdurre qualche altra semplificazione. I metodi semplificati attualmente in. 
uso sono due: il metodo della molecola isolata, che sostituisce allo stadio € 
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lo stadio » (corrispondente al primo avvicinarsi del reagente alla molecola 
aromatica) e il metodo della localizzazione, che sostituisce allo stadio € lo stadio d 
(cioè al complesso attivato un modello ideale accessibile ai nostri calcoli). 

Nella approssimazione della molecola isolata, la reattività di una posizione 
in una reazione con un ione si considera essere determinata dalla concentrazione 
di elettroni x presenti in quella posizione al momento della reazione. Se il 
reagente è uno ione negativo (come accade di solito nel caso di sostituzioni 
nucleofile in nuclei aromatici) gli elettroni x del sistema coniugato tenderanno 
ad allontanarsi dallo ione quando questo si avvicina. Teoricamente si può 
ottenere lo stesso effetto con un aumento dell’integrale di Coulomb dell’atomo 
nella posizione attaccata: si può allora assumere che l’effetto prodotto dal- 
l'avvicinarsi dello ione sia un aumento dell’integrale di Coulomb dell’atomo 
al quale lo ione si avvicina. Dalla teoria generale del metodo degli orbitali 
molecolari è noto che d£ ~ q,6a,, dove g, è la densità di elettroni 7 all’atomo r 
e «, è l'integrale di Coulomb per lo stesso atomo. Poichè dx, è positivo, il più 
piccolo aumento di £ si avrà per il più piccolo valore di q,, cioè più piccola 
è la densità di carica g, nel punto sede della sostituzione, maggiore è la reat- 
tività nucleofila iniziale in quella posizione. 

Nella approssimazione della teoria della localizzazione, come modello del 
complesso attivato si prende uno ione in cui il reagente è legato con un legame 
covalente o all’atomo di carbonio attivato, mentre il gruppo o Vatomo che 
vanno sostituiti sono tuttora legati allo stesso atomo di carbonio mediante 
legame covalente o (in realtà si dovrebbero assumere dei legami parziali). 
Ad esempio, nel caso del clorobenzolo attaccato da uno ione ossidrile, il mo- 
dello del complesso attivato sarà il seguente: 


L’atomo di carbonio sede della sostituzione, originariamente in uno stato 
di ibridazione sp?, è assunto essere in uno stato prossimo a quello di ibrida- 
zione sp*, e non fa più parte del sistema coniugato. La differenza in reattività 
tra molecole simili o tra le diverse posizioni di una molecola è data — in 
questa approssimazione — dalla differenza nella variazione di energia del 
sistema di elettroni z nel modello assunto per il complesso attivato (posizione d 
nella curva di fig. 1) e nello stato iniziale (posizione a). 

Entrambe le teorie sono state ripetutamente saggiate su vari derivati ben- 
zenici e sempre con esito positivo. Recentemente il metodo è stato esteso anche 
a derivati della naftalina. Precisamente è stata determinata la velocità di 
reazione dell’1-cloro-2-nitronaftalina (A) dell’1-cloro-4-nitronaftalina (B) e 
della 1-nitro-2-cloronaftalina (0) con idrato potassico o con metilato o etilato 
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sodico in solventi vari e a diverse temperature. In ogni caso l’ordine di reat- 
‘tività è risultato il seguente: A > B > 0. 

Il metodo della molecola isolata ha portato ai seguenti risultati: nel?1-ni- 
tronaftalina q = 0.792 (in unità elettroniche) in posizione 2 e g= 0.632 in 
posizione 4; nella 2-nitronaftalina q = 0.566 in posizione 1 (nell’esecuzione 
del conto sono stati adottati, per i parametri necessari, i valori usati da 
WHELAND e PAULING per il nitrobenzolo ed è stata trascurata la coniuga- 
zione con gli elettroni p del cloro e l’effetto indut- 
tivo di questo atomo). Similmente il metodo della 
localizzazione ha dato i risultati che seguono: 


per 4, AW=— 0.997 (in unità f=—20 kcal/mole); 
per B, AW=— 11682; per C, AW=— 12468. 


Come si vede chiaramente dal grafico di fig. 2, i 
risultati sperimentali e quelli teorici sono in ottimo 


ore ateo 


accordo. 

Alcuni mesi fa si è scoperto che le reazioni di 
Fig. 2.- Aspetto qualitativo 
delle curve di energia po- 
tenziale: A=1-nitro, 2-clo- : : i ata i 
ronaftalina: B = l-nitro 4- meccanismo, che possiamo definire di eliminazio- 
cloronaftalina; 0 = 2-nitro ne-addizione. J. D. ROBERTS, nel 1953, ha pubbli- 


1-cloronaftalina. ‘ato una breve nota sui processi di amminazione 


sostituzione aromatiche mediante un agente nu- 
cleofilo possono avvenire anche con un diverso 


basica degli alogenuri arilici, notando che queste 
reazioni sono spesso accompagnate da processi di «riarrangiamento »; -ad 
esempio da orto o metacloroanisolo si ottiene la meta-anisidina, mentre dal 
composto para si ottiene una miscela di para e meta derivato. Inoltre queste 
reazioni non hanno luogo se in posizione orto all’alogeno da sostituire non 
c'è almeno un atomo di idrogeno. Questi fatti suggerivano un processo di 
deidroalogenazione da parte degli ioni NH". RoBERTS ha condotto la stessa 
reazione sul clorobenzolo contenente un atomo 10 (quello al quale è attaccato: 
il cloro). Operando con potassioammide in ammoniaca liquida a — 33 °C il 
cloro viene sostituito con grande rapidità e e Vanilina così ottenuta possiede 
il gruppo amminico per il 50% attaccato all’atomo di carbonio marcato, e 
per il 50%, in posizione orto a tale atomo. 
C. K. INGOLD, anche in base a considerazioni sugli stati eccitati delle mole- 
cole di benzolo e acetilene, ha formulato per la reazione il seguente schema: 


ate Pe Z~e A SeNH: Sete: 
+0 ll. Yield ele 

| | | 1 
A SA ZE i 


con la formazione intermedia di un « benzino ». 
Dopo aver discusso le reazioni di sostituzione nucleofila alifatica e aroma 


CI 
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tica, consideriamo ora un caso in certo senso intermedio: quello delle reazioni 
di sostituzione nucleofila nelle catene laterali di derivati benzénici. Per le 
reazioni di composti di questo tipo è stata formulata su basi puramente empi- 
riche da HAMMETT una teoria, della quale recentemente è stata proposta da 
JAFFÈ una interpretazione su basi quantomeccaniche. Questa teoria riguarda 
le reazioni in genere nelle catene laterali dei derivati benzenici, e qui ne sarà 
considerata la sua applicazione al caso particolare di reazioni di sostituzione 
‘nucleofila. Secondo HAMMETT nel caso di derivati benzenici para e meta sosti- 
tuiti, se si paragonano due serie di costanti di equilibrio o di velocità perti- 
nenti a reazioni che differiscano o per la natura del gruppo reagente o per il 
cambiamento che esso subisce per effetto della reazione, si trova che esiste 
una semplice relazione quantitativa tra queste costanti. La stessa cosa si ha 
quando si paragonino fra loro i membri di una stessa serie di reazioni. (I membri 
di una serie sono identici in quanto alla reazione, ma differiscono per i gruppi 
sostituiti in meta o in para al gruppo reagente). 

Una serie può riferirsi, ad esempio, alle costanti di ionizzazione degli acidi 
benzoici meta o para-sostituiti: un’altra alle costanti di velocità di idrolisi di 
esteri degli stessi acidi. Se si riportano in un diagramma logaritmico le costanti 
di una serie in funzione delle costanti dell’altra, si osserva che i punti rappre- 
sentativi si dispongono con grande approssimazione su di una retta. Punti 
rappresentativi di derivati alifatici o benzenici orto-sostituiti si scostano invece 
decisamente dalla retta precedente e si dispongono a caso sul diagramma. 
Per la retta si può scrivere equazione: 


(1) log K,= o log K,+A, 


in cui K, è la costante di velocità di idrolisi degli esteri, A’; la costante di 
equilibrio della dissociazione acida, e 0 e A sono costanti. 

Analoga relazione lineare vale praticamente per tutte le reazioni delle ca- 
tene laterali dei derivati benzenici meta e para sostituiti e in particolare 
per le reazioni di sostituzione nucleofila. 

L’equazione (1) si può trasformare nel modo seguente: 


log K= o(log K, —log K?) + (A + olog K°), 
dove K è una generica costante di velocità o di equilibrio e A} è la costante 


di ionizzazione dell’acido benzoico. Necessariamente A-+0 log K} è uguale a 
log X°, dove K° è la corrispondente costante di velocità o di equilibrio per la 
molecola non sostituita. Definendo allora o = log XK, log K° si ha: 

(2) lost — log K°== 00° 


La costante o è per definizione fissata dal sostituente ed è indipendente 


dalla reazione cui si riferisce la costante K: la costante della reazione 9 è la 
medesima per tutti i sostituenti e dipende solo dalla reazione in studio. Per tutt 
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i sostituenti per i quali si conosce con sufficiente esattezza la costante di ioniz- 
zazione del corrispondente acido henzoico sostituito, si ha immediatamente il 
valore di o dalla definizione stessa. Per qualunque reazione per la quale si 
abbiano dati per un sufficiente numero di derivati meta e para sostituiti si 
calcola 0 riportando sul diagramma il log K in funzione di o; con i valori 
di o così ottenuti si possono calcolare altri valori di o non ottenibili diretta- 
inente. HammeTT ha riportato tabelle di valori di o per i principali sostituenti 
in meta e in para e dei valori di o per oltre cinquanta serie di reazioni. 
Poichè l’effetto di un sostituente è essenzialmente quello di produrre diversi 
‘ambiamenti di carica elettronica sui diversi atomi di carbonio dell’anello 
benzenico, è logico cercare un collegamento tra i valori delle costanti o di 
HAMMETT e le densità elettroniche nei vari punti della molecola. Il calcolo 
delle densità elettroniche in nuclei benzenici sostituiti è stato eseguito per via 
teorica da WHELAND e PAULING applicando il metodo degli orbitali molecolari 
nella sua approssimazione standard di combinazione lineare di orbitali atomici. 
Le difficoltà inerenti all'applicazione di tale metodo di calcolo a composti con- 
tenenti eteroatomi sono essenzialmente quelle di assegnare gli opportuni valori 
agli integrali coulombiani degli eteroatomi presenti nella molecola, agli inte- 
grali di risonanza carbonio-eteroatomo e di tener conto dell’influenza dell’etero- - 
atomo sugli atomi di carbonio vicini. Queste difficoltà sono state superate da 
WHELAND e PAULING e dagli autori che successivamente si sono occupati di i 
questo problema assegnando valori empirici alle grandezze in questione. 
Successivamente è stato messo in evidenza che la maggior parte dei calcoli i 
eseguiti non era sensibile alla scelta dei valori assegnati a detti parametri. 
Una grandezza estremamente sensibile a tali valori è stata recentemente iden- - 
tificata da JAFFE nei valori degli incrementi delle densità elettroniche nelle: 
posizioni para e meta al sostituente e soprattutto del loro rapporto. 
Consideriamo un composto avente un sostituente X in posizione meta 0! 
para rispetto a una catena laterale Y. L'effetto del sostituente X in meta oil 
in para a Y in una reazione della catena laterale Y è presumibilmente il risul- 
tato dell’influenza di X sulla densità elettronica del centro reattivo. Il sosti-- 
tuente modifica la distribuzione degli elettroni 7 nell'anello benzenico; il cam-: 
biamento di carica dell’atomo di carbonio al quale è attaccata la catena è? 
trasmesso al centro reattivo in modo che dipende dalla natura della catenai 
laterale Y. La validità generale dell’equazione (1) dimostra che il cambiamentot 
prodotto da X nell’atomo di carbonio al quale è attaccata la catena è essen-: 
zialmente indipendente dalla natura di Y. (Naturalmente si escludono i casiù 
in cui Y fa parte del sistema di risonanza). Perciò si può assumere che unt 
analogo cambiamento di densità elettronica sarà prodotto da X nella mole- 
cola corrispondente nella quale ad Y sia sostituito un atomo di idrogeno. 
Segue che gli incrementi di densità elettronica nelle posizioni meta e para dei 
benzoli monosostituiti dovrebbero essere una funzione monotona dei corri 
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spondenti valori di o; e in buona approssimazione tale funzione si può ricon- 
durre a una proporzione diretta tra i valori di o e gli incrementi di densità 
elettronica. 

JAFFÈ ha eseguito i calcoli necessari per determinare i valori dei para- 
metri nel caso di diversi derivati benzenici monosostituiti del tipo più 
semplice, cioè quelli che contribuiscono con un solo orbitale z al sistema di 
risonanza. Nel caso delle posizioni orto la densità elettronica è calcolata cor- 
rettamente con gli stessi parametri; i valori delle costanti di velocità sono 
però influenzati da effetti sterici dei quali occorre tener conto aimeno quali- 
tativamente se si vogliono includere questi derivati nella teoria. Il successo 
dei lavori di JAFFÈ dimostra la possibilità di collegare le densità elettroniche 
nei benzeni monosostituiti coi valori c di HAMMETT almeno nei casi in cui la 
variazione di tali densità prodotta dal sostituente sia piccola, ed è buona ga- 
ranzia che l'ipotesi fondamentale di una dipendenza monotona delle costanti o 
dalle densità elettroniche è essenzialmente corretta. Precisamente JAFFÈ ha 
stabilito la relazione o = — 15.5 Aq per cui, nel caso di una reazione per la 
quale sia stabilito sperimentalmente, mediante altri derivati, il valore di 0, 
è possibile, dal valore di Aq calcolato col metodo M.O. per un nuovo sosti- 
tuente, predire il valore della costante di velocità. 

Una nuova conferma della relazione tra i valori o di HAMMETT e la con- 
centrazione di elettroni all’atomo di carbonio al quale è saldata la catena è 
stata recentemente ottenuta da GUTOWSKI e collaboratori misurando lo schermo 
del momento magnetico del nucleo !*F nel fluorobenzene e in alcuni fluoro- 
benzeni sostituiti, e stabilendo l’esistenza di una relazione tra le loro costanti di 
schermo ò e le costanti o.. La relazione tra i valori di o e la concentrazione 
elettronica è così confermata da risultati di misure cinetiche, di misure magne- 
tiche e di calcoli quantomeccanici. 

Come esempio di applicazione della teoria di Hammet-Jaffè a reazioni di 
sostituzione nucleofila, esaminiamo il caso della idrolisi del cloruro di benzile 
e dei suoi derivati orto, meta e para metossi-sostituiti in miscele acqua-acetone. 
Le densità elettroniche nelle varie posizioni dell’anisolo sono: 


orto: 1.036, meta: 0.992, para: 1.017. 


Inoltre è 
Cn ='0.115% ees 0.268). 


Le reazioni solvolitiche, tutte rigorosamente del primo ordine, decorrono 
secondo il meccanismo S aie) la reattività dei vari composti decresce nell’ordine: 


para > orto > (non sostituito) > meta , 


cioè nello stesso ordine in cui decrescono le concentrazioni elettroniche all'atomo 
di carbonio al quale è legata la catena laterale, eccetto che per l’inversione 
orto/para dovuta ad effetti sterici. 
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Passando al cloruro di f-feniletile e ai suoi derivati metossi-sostituiti, è 
stata determinata la costante k, di velocità della reazione con KOH in solu- 
zione acqua-alcool. I valori di k, (in h-1/mole-!1-!) sono riassunti nella seguente 
tabella: 
70 °C 


60 °C 70 °C 60 °C 
non sostituito 0.340 1.080 meta 0.510 ATOR 
orto 0.0725 0.224 para 0.125 0.370. 


In questo caso le reazioni sono del secondo ordine e il meccanismo è S,, 
(in realtà è presente anche la reazione di eliminazione a meccanismo H, con 


n 


— log(100K) 


-0.3 ie) 0.2 
Fig. 3. — Curva log k,, per 
la reazione fra i cloruri di 
metossifenile e idrato po- 
tassico in alcool acquo- 
SOs O99 == 0.010,00 


formazione di oleofina); e i dati, in ottimo accordo 
con la teoria, permettono di valutare graficamente @ 
(fig. 3). 

Se consideriamo la reazione di sostituzione in as- 
senza di KOH troviamo a 95 °C i seguenti valori di 
k, (in h-! perchè la reazione è del primo ordine): clo- 
ruro di feniletile 0.0034, cleruro di para-metossi- 
feniletile 0.0105. Si vede come passando dal mecca- 
nismo S,, al meccanismo S,, l'ordine di reattività dei 
due composti è invertito. 

Ci sono poi delle reazioni il cui andamento cine- 


= 0.0164. tico non rientra in nessuno degli schemi già citati, in 
quanto alcuni fattori che generalmente hanno impor- 
tanza secondaria e quindi vengono trascurati nelle schematizzazioni adottate, 
assurgono ad un ruolo predominante determinando così l’andamento cinetico 
della reazione. Come primo esempio di queste reazioni particolari citeremo il 
caso delle interazioni 1,3 0 della risonanza cosidetta omoallilica. 

È stato notato che i derivati colesterilici hanno una notevole velocità di 
ionizzazione, attribuibile alla delocalizzazione della nuvola elettronica del 
gruppo olefinico adiacente al centro di reazione nello stadio che determina la 
velocità. Questa partecipazione del gruppo olefinico 5,6 al processo di sosti; 
tuzione dà luogo ad uno ione IT; che si può scrivere come risonante tra le strut- 
ture Ila e IIb, con parziale legame 3,5 e indebolimento del legame 5,6. Questa 
specie intermedia reagisce con le specie nucleofile a C, per dare 3,5 cicloste: 
roidi (III) o a C; per dare derivati colesterilici (IV). 

| 
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Il termine omoallilico è stato dedotto in base all’analogia col caso dello 
ione allilico: 


| \ oF Xe i | oppure SAAS 
| 


rispetto allo ione allilico si ha l’interposizione di un gruppo —CH,— tra il 
centro cationico e il sistema di elettroni x. 
Per alcuni casi semplici, quali i sistemi V, VI VII 


SL ea 2 
V VI VII 


è stato possibile calcolare l'energia di stabilizzazione competente alla configu- 
razione più stabile, nella quale si ha equilibrio tra la tendenza del centro catio- 
nico ad avvicinarsi al sistema olefinico con conseguente aumento dell’energia 
di risonanza, e la resistenza a diminuire l’angolo tra i legami o tra gli atomi 
di carbonio 1,2 e 2,3 con conseguente diminuzione dell’energia di tali legami. 

I valori trovati per tale energia sono: 6.2; 9.6 e 4.2 kcal/mole. per V, VI, 
VII rispettivamente. L’ordine di stabilizzazione dienile > vinile > fenile tro- 
vato per questi gruppi in posizione f è lo stesso che per Vordinaria coniuga- 
zione di questi gruppi con un legame olefinico. Ma ciò che più interessa in 
questo momento è l’ordine degli effetti di aumento di velocità dei f sostituenti 
nelle reazioni di solvolisi. Per esempio, le velocità di solvolisi dei paratoluen- 
solfonati di colestanile VIII, colesterile IX e deidrocolesterile X sono nel rap- 
porto 1:100:3000. Tali valori corrispondono a una stabilizzazione degli stati 


| | 
PIA a Cote, fae pi 
mor 7 Tee N or NA 
WADE TO a 


di transizione per IX e X di circa 3 e 5 keal/mole anche se l'orientamento 
spaziale non è l'ideale per la sovrapposizione degli orbitali . 

Un altro caso particolare che discuteremo è la reazione di sostituzione del 
cloro con iodio nei derivati feniletilici. ConANT e KIRNER hanno determinato 
le costanti di velocità del secondo ordine per le reazioni di sostituzione a mec- 
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canismo NS, : 


R—Cl + KJ = R—J + KCl 


per i composti C,H,(CH,),Cl con n variabile da 1 a 7. I valori delle costanti 
cadono bruscamente passando dal composto con n=1 a quello con n=2; risal- 
gono alquanto per n=3 per tornare a scendere per n=4 e rimanere quindi 
costanti (ed eguali al valore trovato per i derivati alifatici a catena normale) 
al crescere di n. Recentemente lo studio cinetico di questa reazione è stato 
esteso ai derivati orto, meta e para metossi-sostituiti del cloruro di benzile, 
di f-feniletile e di y-fenilpropile. Si è trovato che un aumento della concen- 
trazione elettronica al centro di reazione favorisce la reazione nel caso dei 
derivati benzilici e fenilpropilici, mentre la ritarda nel caso dei derivati fenil- 
A etilici, e viceversa per una diminuzione di 
tale concentrazione. 

Come ¢ noto, entrambi gli effetti sono 
0 prevedibili nelle reazioni a meccanismo pore 
| il diverso tipo di comportamento nei singoli 
casi si può attribuire al fatto che il maggiore 
contributo al valore attuale dell’energia di 
attivazione può venire sia dal processo di 


log Ka 


dà parziale rottura del legame esistente che dal 
processo di parziale formazione del legame 


nuovo. 
———_—_x R 
Si See: | 
! RESA Cles Gi) 
1 2 3 4 Yar 
Fig. 4.— Logaritmi delle costanti È: R I 
di velocità sperimentali (x) e 
ideali (0) in funzione del numero 
di atomi di carbonio della ca- Queste considerazioni e le analogie strut- 
tena laterale. turali esistenti tra questo caso e Vanalogo 


esempio di diminuzione di reattività passando 
dal cloruro di butile terziario al cloruro di neopentile, inducono a pensare che 
la causa della forte diminuzione di velocità sia da ricercarsi in fenomeni ste- 
rici. Si può peraltro calcolare approssimativamente, secondo un metodo dovuto 
a INGOLD, HUGHES e DostROvsKy, l'incremento di energia prodotto da impe- 
dimento sterico nel passaggio da composto iniziale a complesso attivato nel 
caso dei derivati benzilici, feniletilici e fenilpropilici: i risultati sono rispetti- 
vamente 0.2 kcal/mole, 1.4 keal/mole e 0.9 kcal/mole (quest’ultimo valore 
rimane costante per gli omologhi superiori della serie). 
È ora possibile, togliendo alle energie di jattivazione totale la parte « ste- 
rica », calcolare i valori delle costanti di velocità «ideali » della reazione. vale 


aa. 
> 
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a dire i valori che tali costanti avrebbero se fosse possibile abolire le forze di 
compressione tra gli atomi non legati che, nel complesso attivato, vengono a 
trovarsi troppo vicini fra loro: i risultati appaiono in fig. 4, dalla quale si 
vede che via via che ci si allontana dal gruppo fenilico la reattività del cloro 
decresce regolarmente per raggiungere il valore costante dei derivati alifatici. 
Si raggiunge così una Spiegazione conforme all’apparente anomalia presen- 
tata dalle costanti sperimentali. i 
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i. — The Question of « Hidden Variables » in Quantum Mechanics. 


Since the very earliest period, the physical interpretation of quantum mechan- 
ics has been a subject of controversy. As the discussion progressed, it grad- 
ually became centered around the conflict between two well-defined opposite 
points of view (‘-5). Although hitherto neither has obtained a clear-cut logical 
victory, most physicists have tended to resolve their doubts in favor of one 


(*) Research of this author supported in part by U. S. Office of Naval Research. 

(') The history of the controversy is well summarized (with references) in essays 
by N. BonR (pp. 201-241) and A. EINSTEIN (pp. 665-688) in Albert Einstein: Philosopher- 
Scientist (Evanston, Illinois (1949)). 

(?) A. Ernstern, B. PopoLsky and N. Rosen: Phys. Rev., 47, 777 (1935). 

(*) N. Bour: Phys. Rev., 48, 696 (1935). 

(4) W. Furry: Phys. Rev., 49, 393 (1936). 

(9) A. Ernstern: Physics and Reality, $ 5, in Journ. Frank. Inst., 221, 313 (1936), 
reprinted in A. EinstEIN: Out of My Later Years (New York, 1950). 
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of them. We shall give a résumé of this discussion, as the most convenient 
way of defining the terms needed for presenting our theory (5). 

The more generally accepted view has been that identified mainly with 
NIELS BoHR. BoHR adapts the physical picture to the schematic experimental 
situation, excluding any physical concepts that do not correspond strictly to 
experimental elements, even if there is no actual disagreement involved. Thus, 
in a situation in which the position of a particle is sharply defined, the very 
concept of momentum is excluded, since the wave function displays a wide 
dispersion of momentum values. 

The principal advocate of the opposing view has been ALBERT EINSTEIN. 
To use Einstein’s characterization, quantum mechanics (which may be con- 
sidered a formal expression of Bohr’s point of view) is not a « complete theory »; 
elements of physical reality, such as momentum in the above example, may 
exist and have definite values, even though unsharply defined according to 
the wave function. 

Since advocates of the first view maintain that the quantum-mechanical 
description of reality is complete, it would follow that each individual of a 
number of identically prepared maximally defined (subject to the identermin- 
ation relations) systems is completely represented as to it sphysical state by 
the same wave function. The fact that a measurement of a variable of which 
this wave function is not an eigenfunction will yield a variety of values for 
different individual systems is not, according to BoHR, to be used as an argu- 
ment for supposing that there exists a theoretically definable difference between 
these systems prior to the measurement. Indeed, such a conclusion is easily 
shown not to be a necessary one: the variable which was sharp in the wave 
function and the variable finally measured correspond to mutually exclusive 
possibilities of definition experimentally; the process of measuring the latter 
includes an unobservable and uncontrollable disturbance of the value of the 
former, making its vatue different (7) in each case. This accounts for the dif- 
ference between systems after the measurement, without assuming a difference 
before measurement. But BoHR replaces this negative conclusion (lack of 
necessity for assuming a theoretically definable difference prior to measurement) 
by a positive one. The differences between different systems with regard to 
the second variable are, so to speak, created at the moment of its measurement 
and do not exist previously. Not that all had identical values, rather one must 
say that the second variable (within more or less broad limits depending on 


(5) The theorem that is basic to the present work was given in an earlier paper 
by the authors, N. WIENER and A. SIEGEL: Phys. Rev., 91, 1551 (1953). We refer to 


this paper here as WS. 
(7) Actually this variable becomes undefined after the measurement, as was the 


other before; see below. 


984 N. WIENER and A. SIEGEL 


the particular wave funetion) has no clear-cut prior existence; not only its 
values, but the second variable as a defined concept, are first created by the 
act of measuring it. The logic of the assertion that the systems are different 
after the measurement, even though identical before, is protected by the fact 
of the unobservable disturbance of the first variable; this is the essential réle 
of the indeterminacy principle. 

Without questioning the ability of these concepts to account for the pheno- 
mena, EINSTEIN has insisted that it has not been proved that physical theory 
is constrained from examining the area of indeterminacy whieh BoHR maintains 
to be inscrutable even in concept. He asserts, in effect, that when there exists 
a difference between measured values of a variable in two systems, there esists 
an in principle physically definable difference between them. It is not necessary 
to deny the undeniable disturbance sometimes accompanying the act of mea- 
surement in order to assert this; indeed, the difference can be demonstrated 
within the very context of quantum mechanics itself, even when there is no 
disturbance (7). EINSTEIN in effect challenges Bohr’s transformation, described 
above, of a negative into a positive statement. 

As we shall develop it in this paper, the differential-space theory carries 
the controversy sketched above to a new stage of concreteness. This is because 
it realizes the formal program of EINSTEIN: namely, a physical picture in 
which a collection of identically prepared (subject to the indeterminacy prin- 
ciple) systems is represented by an ensemble of different, precisely defined 
systems so that the dispersion in values of measured quantities has its counter- 
part in the variety of properties of the systems in the ensemble. On the other 
hand, the theory and current experimental information are in too provisional 
a State for these facts to constitute in any sense a refutation of Bohr’s point 
of view. That is, it is entirely possible that the individual-description concept 
formally furnished by the differential-space theory may turn out to be phy- 
sically vacuous. 


2. — Individual and Ensemble. 


It is a time-honored fact that assigning (in imagination) sharp values to 
two conjugate variables of a system, and doing nothing more, will result in 
conflict with the experimental facts. In a two-slit diffraction experiment with 
electrons (*), the positional undefinability-in-principle of the electron in the 
Bohr picture serves an indispensable logical purpose. If one could think of 
the electron as defined within a region of extent less than the separation of 
the slits (assuming it to be experimentally not so well defined) one would be 


(*) See L. I. SCHIFF: Quantum Mechanics (New York, 1949). 
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able to assert (in thought) that the electron had certainly gone through one 
of the slits, and would therefore be unable to account for the influence of the 
other slit on the experimental results (the results show a diffraction pattern 
of structure dependent on the separation of the slits, since the electrons are 
by hypothesis experimentally so prepared as to give such a pattern — in part- 
icular, their experimental range of position definition is broader than the sep- 
aration of the slits). 

Referring to the first sentence of the preceding paragraph, it is then evident 
that in order to realize a description of quantum systems in which all observ- 
ables are defined one must do «something more.» The classic way to do this 
«something more » is to add further determining — « hidden » — variables to 
the system; these can then serve the all-important purpose of expressing the 
influence of the «other slit » even when the particle is hypothesized (without 
experimental verification) as not passing through it. 

A theory of hidden variables designed to explain quantum mechanics must 
contain two basic new features. The first. feature, logically speaking, is con- 
cerned with the hidden variables of a single completely described system (in 
EINSTEIN’s, not BoHR's sense). This we shall refer to as its dynamical part. 
The other essential part of such a theory is its statistical part. This has the 
following funetion: to show how ensembles of completely described systems 
obeying the postulated dynamics can be so constructed as to have the same 
statistical properties as are expressed by a given quantum-mechanical wave 
funetion. (If nature is correctly described by such a theory, individual events 
as described by the conventional quantum mechanics are erypto-deterministie (°), 
i.e., deterministic in themselves, but with an imprecision in their definition 
corresponding to an imprecision in their experimental measurement.) 

The program stated in the preceding paragraph implies certain necessary 
conditions on its realization, tied up with the intuitive meanings our minds 
attach to the terms « deterministic » and «statistical». The following is what 
we feel to be a «fair» or «reasonable » formulation of these conditions. 


1) The individual system is deterministic, i.e., its present state (the pre- 
sent values of all measurable variables) determines its future state (the values 
of all measurable variables at any future time). The system must of course 
be closed if this is to apply. Open systems must first be combined with all 
systems with which they interact (or be subjected to some equivalent proce- 
dure), making them closed, before they can be deterministic. This condition 
concerns only what we have called the dynamical part of the theory. 

The remaining two conditions refer to the statistical part. It is assumed, 
following Gibbs, that all statistical predictions — i.e., predictions regarding the 
distribution of values of physical properties over a large number of identically, 


(9) This term is due to E. T. Wrirraker: Proc. Roy. Soc., 59, 459 (1943). 


— 
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but not (in the dynamical sense) precisely, defined systems —are to be obtained 
from an ensemble of dynamically precisely defined systems, such that a measure 
(or probability, or weight) is given for any suitably defined and restricted set 
of values of the variables. This measure is assumed to be the probability, 
relative to the ensemble, of the subensemble of all systems belonging to the 
ensemble and having values of their variables in this set. Since the ensemble 
isa mathematical representation of the set of actual systems observed, and one 
cannot isolate a negative number of systems, one infers the second condition . 


2) The measure of a suitably defined subensemble is never negative (!°). 

The correspondence between the ensemble and the set of actual observed 

systems is presumed to be preserved with the passage of time. That is, the 

systems of the ensemble duplicate the time-behavior of the actual observed 
systems. The third condition is the formal expression of this idea. 


3) The measure of a subensemble corresponding to a given set of variables 
is conserved an this set changes according to the natural motion (dynamics) 
of the variables contained in it (conservation of probability, or preservation 
of measure, in time; in classical statistical mechanics, the proof of this theorem 
rests on Liouville’s theorem). 


For a physicist, it appears logical to explain such a theory as ours, con- 
taining a new dynamics and a new statistical formalism, by starting with the 
dynamics. This procedure we adopt here. However, the dynamics and the 
statistics are so interwoven in the complete theory that this division is some- 
what artificial. We therefore think it proper to point out in advance that our 
discussion of the dynamics suffers from an appearance of aridity which is due 
(apart from any defects in presentation) to this separation from the statistical 
part, which largely determines its form. 


3. — The Distinction Between Operators and Observables. 


The theory of Hilbert space is a necessary part of the mathematical foun- 
dation of our theory. The observables of our theory, as in quantum mechanics, 
are associated with the set of linear Hermitian operators in Hilbert space whose: 
eigenstates form a complete set. In particular, the analysis of such operators. 


(10) Ensembles agreeing with quantum mechanics, but having negative measure for 
some subsets, have been defined. See E. P. WIGNER: Phys. Rev., 40, 749 (1932); J. P. 
Moya: Proc. Camb. Phil. Soc., 45, 99 (1949); H. Wry: Theory of Groups and Quantum 
Mechanies (London, 1981), pp. 274, 275. Such an ensemble cannot have a collection. 
of real physical systems as its counterpart. 
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given by Dirac (1!) will be useful to us as a physically-oriented presentation 
of their broad features. In this connection, however, a difficulty arises of 
which we must warn the reader. The closeness of the relationship between 


the physical observable and its corresponding operator in quantum mechanics. 


has led to a quasi-identification of the two, much used by Drrac and others, 
Which is useful within the context of quantum-mechanical discussions, but 
would be misleading for us, because the relationship between them is different 
in our theory. When a pairwise, one-to-one relationship exists between the 
members of two sets of quantities, it is often possible to use the same name 
for both members of a pair without causing any confusion, particularly when 
the relationship is a very familiar one. But when, as in the present article, 
different definitions of the relationship are being contrasted, the distinction, 
namely operator vs. observable, needs to be carefully maintained. 

We should, in fact, warn the reader that in the following text the word 
«observable » will often not mean what he is used to having it mean... For 
example, take two operators which do not commute; corresponding to this, 
one would have the statement in quantum mechanics that their corresponding 
observables do not both simultaneously « have values » for all physical states 
in which one of them «has a value». In quantum-mechanical shorthand, 
one says that the «observables do not commute». This is impressionistic 
language which underlines the parallelism between « observables not simul- 
taneously ‘‘having values” » and « operators not commuting». But in our 
theory, observables always «have values », not necessarily for the ensemble, 
but for the individual case. Hence there is no utility for us at present in 
he fagon de parler « observables do not commute ». 

One note further on terminology: By the phrase « quantum mechanics », 
whether used substantively, adjectivally, or adverbially, we always mean to 
efer to the standard theory whose clearest and best exposition is probably 
Jirae’s book. It should be observed that from our point of view, however, 
quantum mechanics » or « quantum dynamics » are (at least formally) mis- 
lomers, since they imply that the systems described are individuals, whereas. 
ur position is that they (again, at least formally) really represent ensembles. 
‘his is just the dichotomy of viewpoint discussed in §1. The customary usage 
3, nevertheless, internally consistent and we think it best to adopt it here. 
‘he qualifier « quantum » in these expressions will prevent them from being 
onfused, even in our context, with their true, individual counterparts. 

When we refer to «our theory », we shall call it just that, or «the diffe- 
ential-space theory », or «the differential-space theory of quantum systems ». 

We now proceed to describe the dynamical specification of individual systems 


(4) P. A. M. Dirac: The Principles of Quantum Mechanies, 3d edition (Oxford, 
947). 
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in our theory. For simplicity we assume systems that are such that the com- 
plete set of independent commuting Lh.c. operators (!°) required for their 
observational description in ordinary quantum mechanics (‘*) consists of one 
operator, i.e., system of «one degree of freedom » quantum-mechanically. 

This is not a real restriction. Any system can be reduced to one of a single 
degree of freedom by the following formal procedure. Consider the product 
space of the Hilbert spaces of the individual members of its complete set of 
commuting operators, and take as eigenbasis in the product space the direet 
products of the eigenvectors of the individual operators. In the product space 
define an operator, diagonal in this representation, having eigenvalues that 
may be arbitrary, except that no two shall be equal. This single operator, 
considered as an observable in the quantum-mechanical sense, is a single 
degree of freedom that furnishes a representation adequate for the quantum. 
mechanical description of the system. 

Notation. In WS, we used script capitals for observables, bold-face capitals: 
for operators, and ordinary capitals for eigenvalues (R, R, and È respectively). 
Here we discard the use of script capitals in favor of a notation to be given in| 
the third paragraph of the next section. There will be no change in the: 
meaning of bold-face or ordinary capitals. 


4. — Dynamical Specification and Dynamies in Differential-Space Theory. 


From the set of all non-degenerate lh.c. operators in the Hilbert space of! 
a dynamical system according to quantum mechanics, to be called N, define: 
a subset / such that no two members of the subset commute, but any member’ 
of N not belonging to / commutes with a member of J. Call this subset J.. 
I is in effect a maximal subset of non-commuting non-degenerate operators;: 
it contains one and only one non-degenerate 1.h.c. operator for every orientation! 
of principal axes. 


DYNAMICAL POSTULATE 1: The independent observables of the system are ai 
set of real numerical variables that can be placed in one-to-one correspondence with 
the members of I (14). 


(?) We use the term «Lh.c. operator» to denote «linear Hermitian operator having: 
a complete set of eigenstates », i.e., an «observable» in Dirac’s nomenclature. 

(13) P. A. M. Drrac: loc. cit., $ 14. This set need not consist of constants of the: 
motion. 

(14) This postulate is not intended dogmatically. Ultimately, modifications, refi- 
nements, perhaps drastic reductions in the number of independent observables may: 
be possible. The present formulation is aimed at obtaining a dynamics which can bes 
brought into agreement with quantum mechanics through our statistical theory in thes 
easiest possible way. 
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Notation. The observable corresponding to R in accordanee with the above 
postulate will be symbolized by enclosure in brackets: [R]. When an observ- 
able is under consideration independently of its relation to any operator, we 
may use a letter symbol alone, e.g., x or f(a) (cf. next paragraph). 

The set I is of course partly arbitrary. Of any subset of N consisting of 
a given non-degenerate operator and all those that commute with it, only 
one will be a member of J, but this may be any one. When a choice has to 
be made in a concrete situation, the variable with the most immediate physical 
meaning will probably be most convenient, e.g., # (the spatial variable) would 
be preferable to any f(r) #4. The choice is in any case not in principle im- 
portant, since the operator chosen is to be regarded merely as an arbitrary 
representative of a maximal set of functionally dependent operators, the latter 
being the important entity (1°). 

As for dependent variables (functions of the independent variables), we 
first treat functions that do not take on any given value at more than one 
point of the range of the independent variable. These correspond to non- 
degenerate operator functions (1*"7) of the operators of I. The statistical theory 
to be erected on the dynamical base now being constructed allows us in this 
case to maintain functional relationships invariant: 


DYNAMICAL POSTULATE 2: An observable that is a function f(LR|) of an 
observable R that does not take on the same value for any two values of R, corres- 
ponds to the operator f(R). 


When this postulate is written down in symbolic form, the invariance pro- 


perty is very clearly displayed: 


The case of functions that do take on the same value for more than one 
point of the independent variable requires special treatment, because they 
are related to degenerate operators. A degenerate operator is a function of 
(i.e., commutes with) an infinite number of mutually independent non-degen- 
erate operators, one for each distinct basis in the degenerate subspace. The 
operator f(R) alone no longer serves for the correspondence; if the independent 
operators R and S both commute with f(R), f(R) is also a function, say g(S), 


(5) We might have used, instead of /, the set of distinct eigen- bases of the Hilbert 
space (spectral families, spectral measures, or resolutions of unity, in the rigorous 
theory). We prefer to speak in terms of operators rather than eigenbases because the 
operators are more directly related to the ebservables of our dynamics. 

(16) P. A. M. Drrac:.loc. cit., $$ 11, 19. I 

(7) B. v. Sz. Nagy: Spektraldarstellung, linearer Transformationen des Hilbertschen 
Raumes (Berlin, 1942; Ann Arbor, 1947)* Chapter X, $ 1. 
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of S; but it is necessary for us (as will become clear later, see end of $ 5), to: 
maintain a distinction between f([R]) and g([S]). If we introduce the notion è 
of a degenerate operator f(R) with respect to the basis given by R, i.e., f(R) is 
(arbitrarily) to be assigned this unique eigenbasis only, then g(S) becomes in 
this sense a different operator from f(R) and the one-to-one relation between 
operators and observables can be restored. 


ma 


DYNAMIC POSTULATE 2’: An observable that is a function f(LR]) of an observ- - 
able [R], and that does not take on different values for all points of the range of [R], . 
corresponds to the operator f(R) taken with respect to the basis given by R. 


pel 


We now have a complete correspondence for the dynamical variables of 
a system, in the sense that all Lh.c. operators have variables assigned to them, 
and there are no other variables (except in the sense that we have gone some- 
what beyond the usual definition of an l.h.c. operator in the case of degenerate : 
operators). This set of variables is the minimum that is necessary and suffi-- 
cient for the statistical correspondence we later set up with quantum me- 


=~ È 


1 


chanics. 
Our next postulate defines the possibi; values or an observable and takes4 
the simple — and familiar — form: 


DYNAMIC POSTULATE 3: The only possible values of an observable [R| are 
the eigenvalues of the operator R. (This is to hold whether R belongs to / or: 
not, i.e., R is any Lh.e. operator). Thus the quantization of observed values 4 
is not «explained » in our theory any more than in quantum mechanics, but! 


a 


arises in the same way from a postulated correspondence with l.h.c. operators. | 


The postulates up to now have been concerned only with the dynamic speci- 
fication of systems, i.e., with the framework of variables to be used in the: 
description of the evolution of the system. The remaining discussion of this 4 
section deals with the dynamics proper, i.e., with the time-evolution of the: 
system. 

First we need DvyNnAMIC POSTULATE 4: Associated with any closed system | 
of given physical properties, there is a unitary operator U(t), depending on the» 
time t in such a way that, for arbitrary t', t' | 


(1) U#)U() = Ult + #).. 


It will not startle the reader if we prematurely reveal that U(t) is the same: 
as the time-displacement operator one would use for the System in quantumi 
mechanics. It should nonetheless be appreciated that the precedence of the 
quantum-mechanical time-displacement operator is historical only, and that} 
—within the universe of discourse of the individual dynamics under discus- ; 
sion — the ròle played by U(t) is strictly with respect to this individual dynamics} 


| 
Da. 
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and does not require the concept of the wave function, i.e., of the quantum 
ensemble considered as a whole, to give it meaning. 

We are now in a position to state the basic dynamic principle. Let [R](t) 
be the value of the observable [R] at the arbitrary time t. Then we have the 
single equation that is DYNAMIO POSTULATE 5: 


(2) [R]@) = [U*()RU(7)],_,(0). 


It would have been perfectly correct to have written simply [U-!(t) RU(t)](0) 
for the right-hand side of (1), but the somewhat awkward notation here em- 
ployed is temporarily necessary to make the meaning unmistakable. 
[U (7) Rk U(t)],_ (7) is an observable that is itself a function of time, but its 
time variable is 7; so far as this observable is concerned, ¢t is not its time 
variable but a parameter that defines it, i.e., singles it out from the other 
observables |U-1(t)R U(r) |(7), functions of time 7, but with values of 7 other 
than t. 

Having established our meaning, we shall henceforth use, instead of (2), 


he more succinct form 

(2°) [R](t) =[U'@RU(t)](0) . 

Equations (2) and (2') were written with respect to a fixed origin of time. 
They imply, however, equations of the same form for any choice of the time 


brigin. From (2’) we have, with t=£+#, 


[R(t +0) = [Ut +#)RUt +)](0) 
— [U(t,) U7) RU(t) Ut,)](0) 


8) = [UW )RU(t') (0) 


janetion [R]\(t') of '=t—t,, equal to [R](t). Thus (3) becomes 


l) (RY(t) = [U-#)RU(1)}(0), 


)) [R](t) +0) = [U@)RUM#)](%), 


from equation (2’), with R replaced by U-!(t')RU(t') and t replaced by %). 
Displacement of the time-origin to the right by an amount ty gives a new 
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considered for a variable 4, and a constant ¢’, reveals that the observables [R](¢) 
and [U-:(t')RU(t')](t) differ from one another only through a time displa- 
cement, i.e., at a given instant [R] has the value [U-(t’)RU(t’)] had at an 
instant earlier by an amount t’. In this respect consider a situation in which 
R and U~(t')RU(t') do not commute. Then both may be chosen as distinet 
members of I, and R and U~\(t')RU(t') are therefore, by Dynamical Postu- 
late 1, independent variables. The reasoning following equation (5) shows, 
however, that this is true only if both are considered at the same instant of 
time. [R] at a time # later than è, is dependent on, in fact equal to, 
[U-(t')RU(t')] at t. All this, however, does not invalidate but only makes 
precise the independence of the observables [U-!(t)RU(t)](t) with various 
values of ¢. Holding t, fixed, and assuming the past and future (relative to tp) 
absolutely unspecified, the assignment of arbitrary (within our previously 
defined limitations), mutually independent values to the set of observables 
obtained by letting t take on all values from — co to +oo, results in a de- 
termined past and future for these variables; but this agrees with any intuitive 
concept of a complete set of independent observables for a sharply defined 
dynamic system. (Note, however, that any observable whose corresponding 
operator is not equal to U-!(t)RU(t) for any value of t is undetermined by 
the past values of all these operators U-'(t)RU(t), thus we see that the values 
of all these operators for a given instant do not determine all the variables 
of the system; observables of a system will fall into disjoint sets such that 
observables are dynamically related if and only if they belong to the same set.) 

There is a similarity between the dynamical principle here stated and clas- 
sical dynamics in one of its formulations. It is well known that (1%) in clas- 
sical systems there exists a certain contact transformation, continuous function 
of a parameter (say, A), of the present variables of a system, such that the 
transformed variable for a given value of 4 at zero time is equal to the value 
of the untransformed variable at time t=A. The parallel with our dynamics 
is obvious. There is, however, a difference. In the classical theory, one has ; 
a certain set of independent variables as untransformed variables, and the + 
transformation involves rather general functional relationships between these 
and the transformed variables. In our theory, one has again a set of inde- | 
pendent variables, but the transformation is of a simpler type, the transformed | 
set being as a set identical with the untransformed: that is, the transformation | 
is just a re-labelling of the original variables. 

It is readily apparent that the Dynamical Postulates above stated define 
a system that satisfies condition 1 of § 2. 

We note one important property of the dynamics here defined: Observ- 
ables whose operators commute with the Hamiltonian, and therefore with. 


(15) H. GoLpstern: Ulassical Mechanics (Cambridge, Mass., 1950), $ 8-6. 
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U(t), are constants of the motion of our individual systems, since if R commutes 
with U(t), U4(t)RU(t)=R and eq. (2') gives 


[R](t) = [R}(0) . 


Thus observables that are « constants of the motion » in the somewhat special 
sense of quantum mechanics are quite literally constants of the motion in our 
theory. 


5. — Summary of Properties of Differential Space; Restatement of the Basic 
Theorem of Differential-Space Quantum Theory. 


In this section we set the stage for the presentation of the statistical part 
of our theory. There will be a brief, selective summary of the material in WS 
that is relevant to the present purpose, and an improved proof of the theorem 
in that paper, the « polichotomic theorem », that is basic here. We shall also, 
later in the section, find it appropriate to introduce an important expository 
simplification concerning operators having a continuous spectrum. Finally, 
we supplement the work of WS by showing how the polychotomic theorem is 
to be applied in the case of a degenerate operator. 

First, we briefly characterize differential space (1°) in the form it takes 
with respect to a discrete basis: It is an extended Hilbert space, (1) including 
all un-normalizable vectors as well as the normalizable vectors to which Hilbert 
space is restricted; and (2) in it is defined a measure associated with a given 
volume differential as follows (we assume at first a discrete spectrum, and 
discuss continuous spectrum operators later). Given a non-degenerate I.h.c. 
operator R and its associated eigenbasis consisting of the set of orthonormal 
eigenkets |R;>, each being labelled by its associated eigenvalues È, (2°). A 


(19) N. WIENER: Journ. Math. Phys., 2, 131-174 (1923). An extended heuristic ac- 
count of differential space is contained in WS, and here will also be found further re- 
ferences on the subject. 

(20) The introduction of the lh.c. operator R here is not strictly necessary for the 
immediate argument. ‘The basis is entirely arbitrary, since an operator can always 
be constructed whose eigenkets are exactly the basic kets of a previously assigned 
basis. The operator R is introduced only in order to unify the present stage of the 


argument with later work. 
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point P (21) in differential space is characterized by a vector whose components 
with respect to these axes are, say, &,(P). More precisely: If the vector to 
P has the ket |P), &,(P) is the component of the bra <P| with respect to the 
eigenaxis belonging to R,, Le., 


(5a) E(P)= PRs 


This definition insures agreement with the formal work of WS. &,; is complex; 
separating real and imaginary parts, 


(6) eae Tae 


A weight or probability (2?) is assigned to the set of points P contained in a 
small volume element 


(7) II dé, 06, 


at the point P’, equal to 


(8) II (22)? exp[—|é;|/2]45,,46,=T] (22) exp[— a,| da, dé,; 


a t 


0; and x, are the phase and half the squared modulus of $, respectively: 


(9) = V 20; exp [76,]. 


The increments dé,, and dé,, are always taken positive, i.e., the element of 
probability is necessarily non-negative. i 

Two properties of the above defined weight are basic for the theory of 
quantum systems. 


(21) In WS and in WIENER’s work on differential space (v. foot-note (1°)) the « para- 
meter of distribution » x identifies differential space points and also acts as the variable 
of a space in which the weight of eq. (8) is expressed as a direct measure of sets of 
points. P is, with some extra emphasis on the vector interpretation, essentially equi- 
valent to « in its first function, i.e., relieved of the burden of bearing the weight ex- 
pressed by eq. (8). 

(22) The mean square of any &,, or €,, in the distribution (8) is unity. This makes 
the mean over P of |&;(P)|? or the absolute square of the projection, on to any nor- 
malized vector, of the vector to P, equal to 2. We take this occasion to point out an 
error in § 4 of WS. If eq. (19) there is accepted, [ |p(ox) |? doe = 2, not 1, and cor- 
responding changes need to be made in eq. (22), the equation following (23). and- 
eq. (24). When fluo) Paaz1 the transformation from x to x (WS end of $ 2) is 
not strictly unitary. It can, of course, be made so by making the mean squares of 
|é;2| and |&;,| equal to 1 each. 
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(1) Unitary invariance. If all the points P of a set S are transformed by a 
unitary operator (realized by a unitary matrix acting on the é;) into points 
P' defining a set S’, the weight of S’ equals that of S. This may be understood 
as a hyperspherical symmetry of the weight function in differential space. 

(2) Statistical independence of the —,. The weight is a product of factors 
each containing only one £,. This, combined with unitary invariance, means 
that the components of a vector in differential space with respect to any two 
orthogonal unit vectors are distributed independently of one another. This 
property is especially important for the continuous spectrum. The Gaussian 
form of distribution used here is the only form possessing both of these pro- 
perties, and having finite mean squares for all variables. 

The basic theorem that will be used in constructing the statistical part of 
the present work is that of the polychotomic method described in $ 5 of WS, 
which we summarize here with a shortened version of the proof given there 
in egs. (35)-(38). Let there be given a normalized quantum-mechanical state 
vector whose representative in the R basis is 


(10) a, = <K1|w) . 


We now define a certain correspondence depending on y between eigenvalues 
Rk; and points P, i.e., we define a functional R(P, y) which may take on only 
the values R,. (Although the relation between P and y in this work is not 
without a certain symmetry, in a simple interpretation y is to be considered 
as fixed and P as variable). R(P, y) is to be, by definition, that eigenvalue 
R, for which the ratio of the magnitudes of the associated quantities £,(P) 


and a, is a minimum, Le., 
(11) R(P, yp) = Rx, 


if and only if 


(all è). 


We shall refer to this method of constructing R(P, y) as the polychotomie 


algorithm. 
We now show that for any given k the weight of the region satisfying (12), 


obtained as the integral of the elementary weight expression (8), over this 


region, is just |a, In symbols: 


(13) Prob. {P satisfies (12)}= | | [[(2x)-! exp[— #,]d0,dx,; = |x|? 


i 
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the integral being extended over the region satisfying (12). (We shall call 
this the polychotomie theorem). Eq. (13) is most easily proved if there are only 
a finite number, say n, of non-vanishing a;; one can then apply the result to 
any wave function by going to the limit n — co if necessary. Imagine, for 
ease in writing, the a; renumbered so that a, becomes a, and the remaining 
n—1 non-vanishing a; receive subscripts from 2 to n inclusive. Then (12) 
reads (changing to the x, as variables) 


Cy Ci 


(12’) (all è). 


| Tee 
RALE, 
All É, are to be treated as independent variables of integration subject only 


to the inequalities (12'). These imply no restriction on the 0,, so the angle 
integrations (from 0 to 27) of (13) may be done at once, yielding 


(14) È (n exp [— @;] dx, . 


The inequalities (12’) imply limits of integration on the individual x; inde- 
pendent of one another, as follows: 


È |a, 2 AG E 
(15a) SR, (i = 2, 3,..., N), 
1 |* 
(155) co>a,>0, 
(15€) CO 0 (4 > n). 


For è > n, each v; may be integrated independently, giving a factor 1. We 
remain with 


2 (ce) (c°] (co) 


(16) Jexp [— x, | da, exp [— xs] dir. exp [— 3] dx, [exp [= «de, lane 
1a, * lay? * la; |? * 


(remembering that >) a;|?=1 due to normalization). This (since ja, |? is 
equal to what in the original numbering was |a,|®) proves the theorem. 

The above proof of the polychotomic theorem is equivalent to the work 
of eqs. (35) to (38) of WS. 

If part or all of the spectrum of È is continuous, then in a formal sense 
one thinks of the discrete index è of €;(P) as replaced by the continuous vari- 
able R to give, say, É(R; P). But this transition cannot be so simply carried 
out in the distribution (8). é(£; P) is a component of the vector to P only 
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in an improper sense. The distribution (8) is defined only for resolutions of 
the vector to P with respect to a discrete basis. Various devices may be emp- 
loyed to adapt the polychotomic algorithm to the continuous case, but all 
have the property of introducing discrete bases. One is given in eqs. (40) 
and (41), plus the material in between, of WS; another, having the advantage 
of making the base vectors that are introduced independent of the wave 
function, would be to use eigendifferentials. 

Although the inclusion of the continuous spectrum does not introduce any 
difficulties into the polychotomic algorithm, it does introduce complications 
of a purely formal character into the theory being presented here. For the 
purposes of the present discussion, therefore, we consider it best to adopt the 
tentative fiction that all operators have discrete spectra only. This will allow 
the essentially physical line of argument here being presented to emerge with 
much greater clarity. The physical picture is not different for the continuous 
spectrum. Thus, having presented it here in unified fashion, it will be easy 
in later work to show how the continuous spectrum is incorporated into it. 

The statement of the polychotomic algorithm given above, and the work 
of WS, are adequate only if the operator R is non-degenerate. Consider now 
a degenerate operator, which we call D for the sake of distinguishing it from 
non-degenerate operators. D’s eigenkets |D,) are not unique, and this makes 
the value of D(P, y) non-unique over a set of non-zero measure (this assertion 
is proved in the Appendix). Some way of fixing the eigenkets of D is needed 
in order to make D(P, y) unique, or, rather, to give unique labels to the 
infinite number of different functionals D(P, py). In order to do this in a 
physical way, we note that each member of the infinite set of different complete 
orthogonal bases of D is the unique eigenbasis of some Lh.c. operator belonging 
to the set J ($ 4). Select any pair, say R and S of such members of J whose 
unique eigenbases are eigenbases of D. Then R and S commute with D but 
not with one another, and one has D a function of either: 


(17) D = {(R) = 9(S). 


If the eigenbasis of R is used in applying the polychotomic algorithm to D, 
one obtains uniquely D(P, y) = f(R(P, y)); if that of S is used, one obtains, 
again uniquely, D(P, y) = g(S(P, y)). Thus a unique selection of a functional 
D(P,y) is obtained by specifying along with D a non-degenerate operator 
with which it commutes. We can absorb the degenerate case into the termin- 
ology of the non-degenerate case if we agree to consider f(R) and g(S), when- 
ever explicitly so written, as having distinct eigenbases, viz.: {(R)’s eige- 
basis is that of R, and g(S)’s, is that of S. The inexplicit notation D is re 
jected, under this agreement, as not adapted for the purpose of the differential- 
space theory. An operator is defined by its eigenbasis, not merely by its effect 


on a vector. 


via) 
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In the next section we explicitly introduce the notion of R(P, y)’s (R any 
operator) being a value of an observable belonging to a system associated! 
with the point P. The preceding paragraph is then seen to give the statistical 
background of Dynamical Postulate 2’ and the paragraph preceding it, $ 4. 


‘6. — Statistical Part of the Theory. 


Given a system whose observables in quantum mechanics are such as tot 
correspond to the set of all l.h.c. operators in a certain Hilbert space. Then, 
since R in the polychotomic algorithm is any l.h.c. operator, we can assigni 
a value of each and every observable (via its associated l.h.c. operator) (?%) 
to any point P in differential space, except for a set of measure zero. We take: 
the set of all points of differential space to be the mathematicai representation: 
of an ensemble (74) of systems, each point (*°) corresponding to a system the 
values of whose observables are those associated with the point. For the: 
measure of any suitably chosen (?°) subensemble of the ensemble we take the 
differential-space measure (8), integrated over the associated subset of dif- 
ferential space. We thus have an ensemble of systems that are sharply de- 
fined in Hinstein’s sense, namely that of the simultaneous assignment to the: 
system of precise values of all physical observables, with a positive measure; 
for all subensemble, and such that every statistical property derivable from: 
the wave function by quantum mechanics is verified for the ensemble (the: 
ensemble is, of course, richer in statistical properties than the quantum-. 
mechanical algorithm). Hence we have, so far as condition 2 of § 2 is con- 


(23) Strictly speaking, for the continuous spectrum we can assign only an arbitrarily 
small interval of eigenvalues, not an individual value. This «hedge » illustrates the 
trivial logical difficulties (mathematically, of course, they are not trivial) introduced! 
into the exposition by the continuous spectrum, and helps to show why we chose to 
omit it for the present. 

(24) We use «ensemble » in Gibbs’s sense, i.e., it is a collection of objects used to 
form a statistical representation of a physical situation in which one’s prior knowledge 
is less than maximal. The word «set» is used in the mathematical sense, e.g., «the 
theory of sets.» Any ensemble is, of course, also a set, but we prefer to be more spe- 
cifie when we actually refer to an ensemble. We use the word «ensemble» as if it 
were English; the fact that « ensemble » means « set » in French does not make it mean 
«set» when it is used as an English word. 

(°°) This is of course somewhat loosely stated. The points of differential space are 
a continuum, and cannot be put in one-to-one correspondence with any set of discrete 
objects. This correspondence is more rigorously obtained, as in the phase space of 
classical statistical mechanics, by associating small regions of finite measure, whieh 
can be shrunk down to infinitesimal size in the limit, with systems. 

(2) The corresponding subset of differential space must be Borel measurable. This 
is not an onerous requirement, physically speaking. 


“a 
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cerned, carried out our program of a representation of quantum-mechanical 


Statistical properties through ensembles of sharply-defined Systems. This 
means, essentially, that so far we can establish a suitable ensemble at any 
instant. We have yet to satisfy conditions 1 and 3, which have to do with 
the temporal changes of individual Systems and ensembles respectively. 

Condition 3 is now readily satisfied by having all representative points move 
in differential space according to some unitary transformation. This alone, 
according to the unitary invariance property of the weighting (8), is enough 
for the purpose. Thus we still have a free choice as to the particular unitary 
transformation we wish to use. 

The most natural choice for this unitary transformation is the Schrédinger 
time-displacement operator U(t). That is, the ket at t for a system whose ket 
at 0 was |P>, is U(t)|P>. (We shall also say that P is carried into UP, even 
though the transformations of P and of |P) are formally different sorts of 
things). In effect, |P) satisfies the same Schrédinger equation as the state 
vector | p> (?”). 

The most important consequence of this choice is the simple deterministic 
form it gives for the individual dynamics of systems. That is, condition 1 
of § 2 is satisfied. To prove this we need the following lemma: 


aa ROWP, Wy) = (WARWY(P, »), (83) 
if W is a unitary transformation. Equation (18) is proved as follows: Let 
(19) R(WP, Ww) = f, . 


This is equivalent to 


(20 Be e | < | Rn | WS all m 
a FAL AC ALAA 

But since |WP) = W|P>, |Wy> = W]w, we have 

(21) [CE] Ho | Pm] WIP: : all m. 
7 Ba] Wo} [Rn |W] p>!’ 


(27) Since the space representative of |P) is, typically, not a smooth function, there 
is no question of its satisfying the Schrédinger equation in its differential form. Either 
a) the integral form must be used, 2) an expansion in energy eigenfunctions is carried 
out, with each individual eigenfunction propagating itself in time according to the 
factor exp [—iHt/h], orc) the function is considered as the limit of a sequence of 
functions that do satisfy the Schrodinger equation, and its motion in differential space 
is then obtained as that of the limit function of this sequence. All three methods, of 


course, give the same result. 
(28) The right-hand side is to be read as if it were f(P, y), with f= (PRW. 
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Now 


(22) RG|W= (WORW),| , 


dd i 


provided we number alike equal eigenvalues of R and of W-RW. Thus (21) 


may be written 


3 all m- 


(23) |K(WARW).|P>| _ | WARW)m| P>| 


eke. = << 33 
<(W*RW);| (9, | | {(W-+RW),.| W? | 
which is equivalent to 
(24) (WoRW)(P, yp) = E. 


Comparing this with (19), one obtains equation (20). 

Eq. (20) may be «proved» geometrically as follows: 

The two orthogonal axes in Fig. 1 
represent the entire infinite-dimensional wily 
set of axes |R;)-|R,) is singled out and Lae 
labelled. The unitary transformation W 
is represented as a rotation. The vectors 
|P) and |w) and the rotated vectors 
W|P> and W|y> are shown. It is appa- 
rent that |P), |y> are to the inversely 
rotated axes W-|R) = |(WORW),, 
exactly as W|P>, W]|y) are to the axes 
| R;). This gives (18) immediately. 


|Ry) 


AU 
We can now show easily that the time Fig. 1. — Schematic proof of Eq. (18)- 

evolution of systems in the ensemble is 

given by exactly the dynamics of Dynamic Postulate 5, $ 4. The definition 

given of the motion of the system point with time means that [R](t) for the 

system represented by P at t = 0 satisfies 


(25) [R}0) — R(UHP, Ut) 
for all t. But eq. (18), with W= U(t), gives then 
(26) © [RK = (U@RU)(P, »). 
If (25) is written down for an arbitrary operator S 


(27) [S](t) = S(U(mMP, U(1)v), 


N 
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then putting t—0 gives 
(28) [S](0) = SP, y), 


hence the right-hand side of (26) is equal to [U(t)RU(t)|(0) for the same 
System, and we have 


(29) [R](t) = [U() RUZ) (0) , 


the same as eq., (2’), ie., Dynamic Postulate 5. 

The fact that (29) applies to the systems of our ensemble means that the 
specification for a system, at a given time, of the values of the observables 
[U-(M)RU(7)] for all tr > # suffices to determine its future, and therefore con- 
dition 1 of $ 2 is satisfied. In particular, a knowledge of wy is unnecessary in 
predicting the evolution of a system if these observables are known, as it should 
be if the system is a single deterministic system. 

Since we have now satisfied conditions 1, 2 and 3 of § 2, we have de- 
monstrated that quantum mechanics is equivalent to a theory of crypto- 
deterministic processes, if these conditions are considered acceptable criteria. 
In this connection one objection might be anticipated: One might argue that 
a description containing an infinite number of parameters is not deterministic, 
operationally speaking, since it is not experimentally realizable in a finite 
time. Against this argument we would assert that it would also make classical 
mechanics operationally indeterministic, since perfete precision in even only 
one measurement would require an infinite amount of time; and it would 
certainly make classical wave theory indeterministic, since the specification of 
a field requires its measurement at all points of a continuum. 

It will be noted that, in satisfying condition 1 of $ 2, we have appealed 
only to Dynamic Postulate 5 of § 4. This is the only one of the postulates 
necessary in satisfying this condition. We should therefore show why the other 
postulates are given at all. Generally speaking, they are required for com- 
pleteness in the description of the individual-particle dynamics as such. More 
specifically: In § 4, a correspondence is set up between Q and the observable [ Q]; 
the polychotomic algorithm sets up a correspondence between Q and the 
functional Q(P, py). We have a crypto-deterministic theory of quantum systems 
once we set up a correspondence between the observable [Q] and the functional 
(until now of merely mathematical status) @(P, y). But the polychotomic 
algorithm, and the discussion in the next-to-last paragraph of § 5, also yield 
statements about the correspondence Q + Q(P, y) when @ is regarded as a 
function of an operator other than Q. Thus the correspondence Q —|[Q] 
may not be set up arbitrarily in this case, but rather defined so that | Q| has 
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the correct relationship to the observable corresponding to the operator of 
which Q is a function. This is the purpose of postulates 2 and 2°. 

The necessity of postulates 3 and 4, in adapting the dynamics to the eventual 
statistical theory, is, we think, self-evident. 

Postulate 1 is fundamental, in that it defines the correspondence Q —[Q], 
and this is its primary raison d’étre. That the statistical theory does indeed 
allow the observables associated with the members of J to be independent 
(i.e., that it generates no functional relationship among them) will not be 
proved here, but left for a later paper. 

At this point we mention again that, as pointed out at the end of § 5, by 
satisfying eq. (2'), the dynamics of the individual systems of our ensemble 
makes observables that are « constants of the motion » in quantum mechanics, 
literally constant for these individual systems. Thus all the conservation laws 
of a quantum mechanical system, and also of the classical system having the 
same formal Hamiltonian, are satisfied by them. 

A later article will deal with possible physical interpretations of the theory, 
with its relationship to other hidden-parameter theories (especially that of 
BouM), with possible variations in the formulation, and other matters. 


We wish to thank Dr. SAUL T. EpsTtEIN for many interesting discussions, 
which contributed to a more clean-cut understanding of the physical inter- 
pretation of the quantum-mechanical wave function. 


APPENDIX 


Proof that the Result of the Polychotomie Algorithm is Non-Unique When 
Applied to a Degenerate Operator. 


We here show (see $ 5) that, when D is degenerate, the value of D(P, w) 
is non-unique over a set of non-zero measure if only the unaltered polychotomic 
algorithm is used. 

Suppose, at some point P, D(P, y) = D, with D, degenerate. Then (see 
eq. (14)), &.(P), the absolute magnitude of whose ratio to a, is the smallest 
of all such ratios at P, is the component of \P> with respect to only one of 
a degenerate set of, say, v eigenkets |D,1>, |.D,2),"... Dx», all having eigen- 
value D,, and should be distinguished from the others — let us do so by 
calling it number 1 of the sequence, (Dl). Now a unitary transformation 
that affects only this degenerate set of eigenkets changes them into a new set 
of eigenkets, say |D,1’>, |D,2'>,... |Dw'> (v'= vy). Let the original &’s be 
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denoted by &x1, Sk, -.., &» and the original a’s by a, dadi UNO! branes 
formed @’s and a’s, by the same symbols primed. Now it is certainly possible 
for the P under consideration to be a point such that of the new ratios in 
the degenerate space |&,,|/|a;,.| («= 1, 2,...,») none is a minimum in the 
entire Hilbert space even though |&.|//a,,) was such a minimum. We set 
up an extreme situation for the sake of a more striking illustration of this 
possibility: choose P so that some |&,;|/|4x;| for which D; ~ D,, say |£,| ||, | 
is just barely greater than the minimum (Gan [Qu |. soci 

Let v—2 for simplicity. Then there is only one other eigenket with 
D= D,, namely | Dx2). Choose |é2|, which is still at our disposal, so that 
[Ero |/[0x=|>|Exx|/[0x:|] and |&./>>|&.|. Take a transformation of the sub- 
space of |D,,) and |D,.> that mixes them in roughly equal amounts. Then, 
due to the second of the preceding inequalities, |&,| and |&,| will each be 
roughly equal to 3|É,|, which is much greater than |£,,|. This fact, with the 
rest of the two preceding inequalities, shows that |&|/|aa|>=&x]/|4x! 
and |6|/|a,.|]>]|€:|/||ax]|. Thus, since |&|//a,|=|&,|/|a,| (the transform- 
ation affects only eigenkets for R = R,), and |&|/|a,| is almost as small as 
(Ex |/|@:|, |&&|/)@,| is the smallest of all the ratios after the transformation. 

To see that the set in differential space for which the above can hold is 
of non-zero measure, one need only observe that non-zero variations of all 
the |&,| about the particular point P are possible without affecting the situation. 
In fact, the range of variation simply increases with the strength of the in- 
equalities of the preceding paragraph. 

Obviously the reverse of the above example is possible, i.e., there are sets 
of non-zero measure for which D(P, y) changes from some arbitrary value 
into a degenerate eigenvalue. The non-uniqueness can be generally character- 
ized as follows: the transformations that interchange complete orthonormal 
sets of eigenkets of a degenerate lh.c. operator will (except for the identity 
transformation) transform the functional D(P, y) over a set of P of non-zero 
measure. At a given point P, at least one of the two values of D that are 
interchanged must be a degenerate eigenvalue. 
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J. — Introduction. 


Basic statistical problems in physics, (e.g., those connected with statistical | 
mechanics and the quantum theory) have, as in other fields, so generally been; 
given a mathematical formulation in terms of the theory of probability, that! 
the impression is created that perhaps no other kind of formulation of such) 
problems is even possible. In the present paper, however, we wish to raise: 
the question of whether the theory of probability really represents the most 
adequate possible mathematical instrument in all of those cases to which it! 
is customarily applied. Indeed, as we shall see in this paper, there is a wide: 
range of statistical applications in physics, in which a more reliable and a clearer’ 
treatment can be obtained by using methods other than those of the theory’ 


(*) Now on leave of absence at Technion, Haifa Institute of Technology, Haifa, 
Israel. 
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of probability. In the study of such applications, one sees that what is re- 
levant is not so much the theory of probability, but rather what we call « the 
general statistical problem », of which the problem of probability is only a 
Special case. 

The general statistical problem may be stated thus: To find those rela- 
tionships which hold in a statistical aggregate independently of a wide range 
of variations of the detailed properties of the individual members of the aggre- 
gate, the size of this range of variation depending on the physical conditions 
and the context of interest. 

It is evident that if relationships of the type described above hold, then 
whatever may be the character of these relationships, we shall be able to make 
statistical predictions concerning the aggregate without having to treat the 
problem of just what the individual members are doing. In those cases to which 
the theory of probability applies, such statistical predictions can be made with 
the aid of the well known probability calculus; but as we shall show in this 
paper, there are a number of rather simple problems, in which it is highly 
advantageous to go outside the scheme of expressing the statistical relationship 
in terms of nothing but the concept of probability. These examples will also 
provide further insight into the meaning of the concept of probability; for 
they enable us to see this concept inside of the broader setting of statistical 
Jaws in general. 

After formulating the general statistical problem with the aid of certain 
examples drawn from the field of statistical mechanics, we shall focus our 
attention on a few cases to which the theory of probability applies. Here, 
we shall discuss the problem of how probabilities are related to determinate 
laws, and even to a more general class of laws lying between the determinate 
and the probabilistic types. Such a process of relating probabilistic laws to 
other types of laws will also enable us to see more clearly what is the nature 
of randomness. Moreover, as we shall see, studies of this kind lead us into 
new kinds of problems that cannot even be formulated in terms of the theory 
of probability itself, such as that of estimating in specific cases the error and 
domain of applicability of various aspects of the theory of probability. 

_ The general plan followed in this paper will be first to review some of the 
basic quantitative ideas on the theory of probability that have already been 
current, then to go on to present the « general statistical problem », and then 
to specialize to the problem of probability. We wish to make it clear at the 
outset, however, that an appreciable number of points appearing in this paper 
will be a repetition, in our own terms of course, of ideas that have already 
been published elsewhere. This repetition is, however, essential for the clarity 
of presentation of the material, because our principal conclusion is really that 
it is just the combination of these various ideas, which have appeared mainly 
in a scattered form until now, combined with our own results, which indicates 
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the possibility of a more general point of view toward statistical problems than 


has thus far generally been considered. 


2. - Chance and Statistical Law. 


The most important and characteristic application of the theory of proba- 
bility is to a statistical aggregate of objects, events, or phenomena subject 
in some way to the action of « chance ». To know what probability means, it 
is therefore necessary to have a definition of chance. We shall give here such 
a definition (*). 

In order to bring out what is meant by chance, we may consider a typical 
chance event; namely, an automobile accident. Now, it is evident that just 
where, when and how a particular accident occurs depends on an enormous. 
number of more or less equally important factors, a slight change of any one 
of which could greatly change the character of the accident, or even avoid 
it altogether (+). Hence, relative to a context in which we consider, for example, 
the precautions that can be taken by a particular motorist, each accident has. 
an aspect that is fortuitous. By this, we mean that what happens is contingent 
on independent factors occurring outside the context in question, which have 
no essential relationship to the characteristic traits that define just what sort 
of a person this motorist is and how he will behave in a given situation. For 
this reason, we say that relative to such a contest, a particular collision is not 
an inevitable development, but rather that it is an accident and comes about 
by chance from which it also follows that within this context, the question of 
just where, when, and how such a collision will take place, as well as that of 
whether it will take place or not, is unpredictable. 

Let us now consider a series of similar accidents. Here, we note that there 
is a complicated and unpredictable variation or fluctuation in the precise details. 
of the various accidents (e.g., precisely when and where they take place, pre- 
cisely what is destroyed, ete.). The origin of this variation is easily understood, 
since as we have seen, the numerous causes on which the accident depends 
are essentially independent of each other and of anything that appears in the 
context of the actions taken by the particular motorist under discussion. Since 


(*) After we had finished this work, Prof. MARIO ScHONBERG called our attention 
fo the fact that sonie of the ideas on chance and probability appearing in this paper 
had already been suggested by Cournot (Erposition de la théorie des chances et des pro- 
babilités, Paris, 1843). 


(+) E.g.: one of the motorists could have started ten seconds earlier or later, or 


slowed down to avoid a cat that happened to cross the road, or he could have given 
the wheel a slightly different turn, ete.. 
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each of these essentially independent sets of causes will in general act at dif- 
ferent and not systematically related times and places, and in different and 
not systematically related ways in all other respects, the net result will be 
just the complicated fluctuations in the details of different accidents which 
we have described above, fluctuations which will be unpredictable in the context 
under discussion (although they are in principle predictable in a sufficiently 
broad context). 

As the number of accidents under consideration becomes larger and larger, 
however, new properties begin to appear; for one finds that individual va- 
riations tend to cancel out and regular statistical trends begin to show them- 
selves. Thus, the total number of accidents in a particular region generally 
does not change much from year to year; and the changes that do take place 
often show a regular systematic trend. Moreover, this trend can be altered 
in a systematic way by the alteration of specific factors on which accidents 
depend. Thus, when laws punishing careless driving are enforced, or when 
regular inspection of mechanical parts is required, the mean accident rate in 
a given region always undergoes a downward trend. In the case of an indi- 
vidual motorist taking a particular trip, no very definite predictions can in 
general be made concerning the effects of such measures, since there are still 
an enormous number of causes of accidents that have not yet been eliminated; 
yet statistically, as we have seen, variations in a particular cause produce a 
regular and predictable trend in the effect. 

The behavior described above is found in a very wide range of fields inc- 
luding social, economic, medical and scientific statistics, and a whole host of 
other applications. In all these fields, the following are the most important 
characteristics: 


1) Each event or result depends contingently on a very large number 
of essentially independent causal factors lying outside the context under dis- 
cussion. 


2) These causal factors all have comparable orders of importance (*) 
(although they will not in general be exactly equal in their effects). 


3) These causal factors vary sufficiently so that in a series of similar 
events, the events of interest fluctuate in a way the details of which cannot 
be predicted from any data that can be taken solely in the narrower context 


(*) If the effects of a few of the factors are many orders of magnitude more im- 
portant than the effects of all the others, combined, then these factors would be re- 
garded as operating systematically. The event in question would still be contingent 
on these causal factors, but it would be inappropriate to describe their effects as 


being those of chance. 
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under discussion. It can be predicted, however, that in a long enough series 
of events, every possibility will be brought about in these fluctuations. More- 
over, the more sensitively dependent the event of interest is on the external 
causal factors the more favorable are the conditions for the creation of chance 
fluctuations of appreciable importance. 


4) In an aggregate containing many members, the chance fluctuations 
tend to cancel out, so that the average or long-run properties can be predicted 
approximately on the basis of statistical laws. Such laws can be applied without 
the need to go to the broader context in which one would take into account 
the additional causal factors that govern the details of the fluctuations of the 
individual members of the aggregate. 


3. — The Concept of Probability 


Historically, the concept of probability was first given a precise form in 
connection with gambling games. A good example is furnished by the game 
of dice. Now, if we follow the results of each individual throws of the dice, 
we discover that the results fluctuate from one throw to the next, in the same 
characteristic way that is demonstrated by a wide range of chance phenomena 
of the type discussed in Sec. 1. Nevertheless, gamblers have developed the 
custom of betting on a given combination and of giving certain odds that 
depend on the combination in question. Experience has shown that corres- 
ponding to each possible combination, there seem to exist a set of «fair odds », 
such that if these odds are offered, then in the long run the gambler will neither < 
win nor lose systematically. 

Now the problems that was attacked by the earliest mathematicians (*) | 
who turned their attention to this question was to find a theoretical way of ‘ 
calculating what these «fair odds » should be. In the case of throws of a die, , 
for example, this problem was solved by supposing that all six faces of each . 
die are «equally likely » in each throw. Thus, the probability that a given. 
die will come out a five is 1/6, and since the dice are «independent », the pro- - 
bability that both will come out fives is 1/6 x1/6 = 1/36. Hence, the « fair* 
odds » are 36 to 1 in this case. 

Although the method of solution of the problem indicated above certainly 
worked in connection with games of chance, it involved the introduction of 
the rather vague notion of equal «likelihood » or « equiprobability » of the: 
various possible results of a throw. This notion contains a mixture of twoi 


(*) The earliest mathematical discussions of probability seem to have been givent 
by Pascar, FERMAT, HuyGHens, BERNOULLI and LAPLACE. 
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very different interpretations of probability, which we may call respectively 
the « subjective » and the « objective ». For the sake of clarity of presentation, 
we shall give here, however, the pure forms of these interpretations, which 
developed only some time after the concept of probability first began to be 
used systematically. 

In the subjective interpretation of probability, it is supposed that proba- 
bilities represent, in some sense, incomplete knowledge or information con- 
cerning the events, objects, or conditions under discussion. Thus, in the game 
of dice, we have no way of knowing with certainty before the dice are thrown 
«what the results of each individual throw will be (since these results are de- 
termined by the initial positions and velocities of the various parts of each 
die which are not accessible to us in practice). Hence, if the dice are sym- 
metrically constructed, we have no knowledge favoring the suggestion that 
we will obtain any one side instead of another, and we therefore assign equal 
probabilities to each side. In this point of view, then, probability is regarded 
as something that measures or reflects our degree of knowledge or ignorance. 
It is therefore essentially a description of a state of mind, and as a result, an 
essentially subjective category, since it would cease to be necessary, or even 
to have any meaning, if we could obtain precise knowledge concerning the 
initial motions of the dice in each throw. 

It is clear, however, that the interpretation of probability as nothing more 
than a description of our mental reflexes under conditions of incomplete know- 
ledge is not adequate in typical statistical problems, such as those arising in 
applications in science and in other fields. For such an interpretation gives 
us no notion at all of why in statistical aggregates containing many members, 
the actual relative frequency is generally close to numerical value of the pro- 
bability (*). The mere fact that we do not know any reasons concerning either 
the strueture of the die or its initial motion that would favor one face 
of a die over another evidently does not by itself necessarily imply approx- 
imately equal relative frequencies for each face. Thus, among the things that 
we do not know about the initial motions of the die, there could conceivable 
exist hidden factors tending to favor one result over another, even if we know 
that the die is well-balanced in its construction. Such tendencies could evid- 
ently exist whether we knew of them or not; and indeed it is just the non- 
existence of these tendencies that is necessary for the characteristic regular 


(*) The importance of the relative frequency in connection with probability was 
first recognized by ELLIS (On the Foundations of the Theory of Probabilities, in Trans. 
of Camb. Phil. Soc., 8, 1-6 (1844)), (paper read in 1842), and then by CourNor (h.wpo- 
sition de la théorie des chances et des probabilté s, Paris, 1843). However, neither of 
these authors fully eliminated the subjective point of view, Von» (The Logie of Chance, 
London and New York, 1866) advoeated systematically a pure frequency interpretation. 
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behavior of the relative frequency in the long rung. Thus, we have to do 
here with an objective property of the die and of the process by which it is 
thrown (*), which has no essential relationship to the degree of knowledge 
or ignorance of this process that may exist in our minds. Indeed, even if we 
somehow were able to obtain perfect knowledge of the initial conditions in 
each case, this would not change the fact that in a typical long series of throws, 
the probabilities would still provide a good approximation to the relative fre- 
quencies predicted on the basis of such perfect knowledge, (provided of course 
that the die is well-balanced and thrown by a process that leaves room for a 
sufficient degree of chance fluctuation in the initial conditions). Thus, the 
probabilities are objective properties of the die and of the process by which 
it is thrown, which manifests itself approximately as a relative frequency in 
a long series of throws. However, the relative frequency does not approach 
uniformly, but fluctuates within bounds that tend to come closer and closer 
to each other as the number of terms increases. 

It is clear, that the objective interpretation of probability described above 
is the one that is appropriate in statistical problems such as those treated in 
science and in other fields. Of course, the word « probability », as commonly 
used, can also be given a subjective meaning in terms of our judgements re- 
garding the likelihood that inferences based on incomplete information are 
valid. Nevertheless, as we have seen, the predictable regular trends in the 
relative frequencies are present in statistical aggregates of objects or events 
distributed by chance processes, whether we make inferences about them or 
not; so that the introduction of our degree of knowledge or ignorance is quite 
irrelevant to the problem under discussion. 

With regard to the role of chance in providing the general conditions or 
background in which the concept of probability can correctly be applied, this 
follows immediately from the discussion of the origin of statistical law given 
in Sec. 2. Thus, in the case of the throws of a die, we have all of the character- 
istic factors tending to faver the creation of chance fluctuations. First of all, 
there is the relatively great sensitivity of the die to the precise character of 
the initial motions with which it is: thrown. If the die is thrown from an 
appreciable height, so that it has time to turn one or more times before it 
falls, then a small change of its initial conditions could change the final result 
completely from any one face to any other. Moreover, the human body is 
evidently a very complex system, containing a great many fluctuating ele- 
ments of comparable orders of magnitude. Thus, it is quite natural that these 
fluctuations are such that in the long run, any face of the die can appear. 


(*) Both Cournor and VrNN recognized this fact explicitly. Indeed, the expo- 
sition in this section follows closely along the general lines laid down by Courxnort. 
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And considering the effective independence of the behavior of these elements 
relative to the physical state of the die, it is hardly surprising that in the long 
run and in the average, these fluctuations favor no particular face, and lead 
to approximately equal relative frequencies for each. We see then in detail 
how the property of probability is a reflection of various aspects of the entire 
process by which a distribution of chance events comes into being. Thus, 
the probability is a property of this process as a whole, which can be given 
a meaning even prior to the realization of any particular sequence as the 
property of the process as a whole from which the predictable trends in the 
relative frequencies come. 

Finally, a word of caution must be given with regard to the use of the 
theory of probability. For it must be remembered that the theory is valid 
only to the extent that the distribation of objects or events has characteristics 
similar to those that would be brought about by the operation of a set of 
genuine chance fluctuations. Such fluctuations depend, for their characteristic 
properties, however, on the independence of the external and contingent causes 
relative to the events under discussion. Now in reality, because everything 
in the universe is interconnected with everything else to some degree, perfect 
independence cannot exist. Nevertheless, in many cases, such as that of throws 
of a die, the degree of interconnection is so slight, that the concept of pure 
chance provides a good approximation. Where the independence is incomplete, 
however, this concept will apply only to a limited degree of approximation; 
and then, many features of the calculus of probability (e.g. the multiplication 
of the probabilities of two events to give the probability that they shall both 
occur) will also apply only approximately. In such cases, we do not have 
pure chance, so that a more precise representation of the statistical laws would 
in general have to go outside the framework of the theory of probability (*). 


4. — The Irregular Colleetives of von Mises. 


The objective interpretation of the théory of probability was given a new 
and more precisely defined mathematical formulation by VON MISES (1). Since 
this interpretation raised the interesting problem of randommess, and more- 
over had an important influence on modern thinking on the subject, in other 
respects, it will be worth our while to discuss and analyze this point of view 


briefly. 
The basic idea of von MISES was to introduce the notion of the irregular 


(*) We shall discuss this problem in more detail in Section 9. 
(1) See, for example, R. von MISEs: Wahrscheinlichkeit, Statistik und Wahrheit, 


dritte. Aufl. (Wien, 1951). 
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collective (or statistical ensemble) as a model for a long series of chance events, 
or for a distribution containing many objects with properties that are subject 
to chance variations. This collective was given the following three essential 


characteristics : 
1) It was assumed to have an unlimited number of members. 


2) In the collective, the relative frequency of every type of object or 
event was assumed to approach a definite number, which was identified with 
the probability of that type of object or event (i.e., the only meaning of pro- 
bability was assumed to be in terms of this relative frequency). 


3) The distribution of objects or events was assumed to have a certain 
property of «randomness » or «lawlessness ». 


To understand the suggestions of VON MISES with regard to the concept 
of randomness, it is convenient to consider the problem of statistical sampling. 
Now, it is well known that in typical distributions of chance events, one can 
draw partial samples by various methods, and that the relative frequencies 
of various classes of events in the samples provides a good approximation to 
those in the sequence as a whole. The suggestion of VON MISES is effectively 
then that the possibility of drawing samples of this kind be used as a defi- 
nition of randomness. More precisely, his suggestions amounted to the idea 
that in a random distribution, subsequences drawn out of the collective in 
suitable ways should have the same limiting relative frequencies for each type 
of event as are present in the main sequence (*). Evidently, in a very regular 
sequence, this would not be so. Thus, if instead of throwing a die, we placed 
it on the table in the sequence 1, 2, 3, 4, 5, 6, 1, 2, 3, 4, 5, 6,..., ete., the limiting 
relative frequency for each possible result would clearly be 1/6. Nevertheless, 
if one sampled every sixth throw, one would always get the same result, so 
that the relative frequency of that result would be unity, and not one-sixth, 
as it is in the sequence as a whole. Thus, regular sequences of this type fail 
to possess certain important properties with regard to sampling that experience 


(*) Actually, von Mises developed his definition originally in connection with the 
problem of winning a gambling game by betting on subsequences of plays chosen from 
the main sequence by some predetermined kind of rule or «system ». The fact that in 
real games, such efforts to «beat the game» have always failed is then taken to 
demonstrate the randomness of the sequence of plays. It is evident, however, that 
the possibility of finding a « winning system » in a gambling game depends on whether 
or not samples drawn from the entire sequence of plays will have the same distribution 
of relative frequencies as are present in the sequence as a whole. Thus, the formulation 
of the definition of randomness in terms of the possibility of drawing representative 
samples contains the gambling game as a special case. i 


THE GENERAL STATISTICAL PROBLEM IN PHYSICS AND THE THEORY OF PROBABILITY LOLS 


has shown to be essential for a random series of events; and it is for this reason 
that von MISES proposes the identification of the property of randomness 
with irregularity, or «lawlessness ». 

The infinite collective of von Mises is essentially a simplified model for 
typical finite but long sequences of chance events. In other words, it is sup- 
posed that if one replaces the finite collective by the infinite model, then cone- 
lusions drawn by using the model will generally apply approximately to the 
real finite collective. The infinite collective is evidently an abstraction, which 
cannot exist in reality. But it is assumed by von MISES to be an abstraction 
which enables him to represent in a simplified or «economical » form the 
essential relationships that exist in a sufficiently large number of chance events. 
In this regard he compares his idealized abstract collectives to the introduction 
of the concept of points in geometry, which he likewise regarded as abstractions 
permitting a simplified or economical representation of the essential aspects 
of the geometrical relationships of real objects that are very small. 

The formulation of probability given by von MISES can be criticized on 
several grounds. First of all, one can raise the question as to whether it is 
adequate to identify probability completely as VON MISES does, with a limiting 
relative fequency in the idealized collective. For from the fact that the pro- 
bability makes possible approximate predictions of future relative trequencies, 
one would be led to conclude that there should exist some property corres- 
ponding to the probability, even before the real collective has come into being. 
For example, let us consider a game of cards in which the relative frequency 
with which a given card will be obtained in a long series of plays is determined 
approximately in terms of a suitable probability. The question then imme- 
diately arises « From what properties existing before the cards are drawn does 
the approximate predictability of the relative frequency come? ». Surely, it 
cannot come from the fact that the plays of the cards that are potentially capable 
of being made belong to an idealized but non-existent collective, which at 
best may be realized approximately only in the future. Hence, the proba- 
bilities onght not to be identified with nothing else but, relative frequencies in 
the idealized ensemble, if only because such an identification implies that the 
approximate predictability of relative frequencies in real ensembles that are 
going to be produced in the tuture comes out of nothing at all existing in the 
present. Such an implication is evidently unsatisfactory, since it does not 
take into account the fact that definite conditions in the present are actually 
needed to lead to predictable relative frequencies in the future. Thus, in the 
case of the card plays, the probabilities will give the right answers only if the 
cards are similar in structure and if they are adequately shuffled by some 
chance process. Hence, it is evident that the approximate predictability of 
the relative frequencies comes out of something else existing at present. One 
is therefore led to conclude that the concept of probability needs a broader 
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base than can be obtained by identifying it with nothing else but a relative 
frequency in an idealized ensemble. 

In this regard, the point of view of VON MISES is a step backwards relative 
to that of Cournot, who had already made it clear that even before an aggre- 
vate of objects or events is realized, the probability reflects various aspects 
of the objects and the processes from which the distribution will eventually 
come. 

The second important criticism of the von Mises theory of probability is 
«connected with the proposed criterion for randomness. The essential problem 
here is to define more precisely what is meant by «irregularity » or « lawless- 
ness». Thus, VON MISES already realized that he could not require that the 
relative frequency in every possible subsequence be the same as in the main 
sequence, since one could, for example in the case of throws of a coin, always 
‘choose that subsequence in which the results came out heads, so that the 
relative frequency of heads would be unity, and not one-half, as it is for the 
sequence as a whole. He dealt with this problem by requiring, that the sub- 
sequence be chosen by a method ot sampling that is defined before all the + 
results of a given sequence are available: or more precisely, that the sub- 
sequence be chosen by a method that depends only on the position of a given 
event in the sequence and on the properties of the events before it. Even 80, 
however, the criterion is very vague, and cannot be applied in any practical . 
problem (e.g. one could hardly count the relative frequencies in all possible : 
subsequences fitting into the above criterion). Moreover, it is by no means 
clear that such a criterion is self-consistent, since it hardly seems likely that 
every method of sampling falling within its scope will give the same relative » 
frequencies as in the main sequence. — 

The problem of defining what one means by «irregularity » or « lawless- - 
ness» is a particularly thorny one, since in the very act of defining it, one is: 
inevitably ascribing to it some kind of regularity and law. Indeed, various : 
members (*) of the school of von Mises seem to have given up the attempt 
ta define randomness as lawlessness. Taking as a starting point a sequence > 
of plays in a gambling game, they propose to define randomness in terms of? 
a specific and well-defined property of such a sequence; viz, that if a player : 
Systematically follows any one of a countable infinity of rules of play whieh | 
they define, he can never «beat» the game. These efforts to define randomness: 
are in their turn, however, subject to important criticisms (2). Aside from) 
sharing in the defect of the approach of von MISES that it is not clear how. 
they could be applied in practice to a wide range of problems, those criteria è 


(*) A. Wap, A. H. CoPELAND, etc. 
(*) J. ViLLE: Etude Oritique de la Notion de Collectif (Paris, 1939). 
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for randomness involve a simple transference to all possible statistical problems 
of a method that may perhaps be appropriate in the special problem of gamb- 
ling games (*). Finally, there has not yet been given a demonstration that 
these criteria can consistently be applied in all of the domains where the theory 
of probability has been used. 


5. — The Modern Axiomatic School. 


The modern axiomatic school of probability developed by KoLMmoGOROFF (3) 
and others is based on essentially the same general assumptions concerning 
chance and statistical law that we have discussed in Secs. 2 and 3. Its essential 
new characteristic is its stress on taking the laws of the theory of probability 
as mathematical abstractions, which can be studied separately from the ques- 
tion of the problem of their interpretation, and of their applicability to real 
problems. 

In this approach, the probabilities are assumed without further analysis 
as «primitive » mathematical concepts, while the relationships between them 
are then taken as axioms. The purpose of the mathematical theory of proba- 
bility is then regarded as being the deduction of the logical consequences of 
these axioms in progressively greater detail. This procedure is compared with 
that used in geometry, where one likewise introduces certain abstract entities 
(e.g., points and lines) as primitive concepts not subject to further analysis, 
‘and where the function of the theory is likewise conceived of as the deduction 
in ever greater detail of the logical consequences following from a certain 
assumed basic set of axioms, by which it is assumed that the primitive con- 
cepts are related. 

The principal advantages of the axiomatic point of view is then that 
without identifying probability prematurely and excessively narrowly with 
any specific property, such as relative frequency either in a real or in an ideal- 
ized ensemble, it enables one to obtain a clear and precise formulation of the 
purely mathematical part of the theory. 

Certainly if one simply considers such a process of abstraction in itself, 
no criticism whatever could possibly be made against carrying it out within 
the domain of pure mathematics, provided that it is clearly understood that 


(*) As we shall see in Section 9, there are strong reasons to suppose that the criteria 
for randomness may have to possess a much wider range of adjustability to the specific 
characteristics of the various statistical problems to which they are applied, than is 
possessed by the criteria that we are discussing here. 

(3) A. Kormocororr: Grundbegriffe der Wahrscheinlichkeitsrechnung, in Erg. d. Math, 


ce, n.3: 


Av 
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perhaps under other conditions and in connection with new kinds of statistical 
problems, new kinds of mathematical abstractions may be needed (Just as. 
the abstractions of Euclidean geometry may have to be replaced by the more 
general ones of non-Euclidean geometry, to deal with astronomical domains 
of space). However the position taken by many members of the modern 
axiomatic school seems to imply that the calculus of probability already re- 
presents the most general possible purely mathematical means of treating 
statistical problems. Such a point of view tends to confine the horizon of 
our thinking about statistical problems to nothing more than the calculation 
of one kind of probability in terms of another kind. On the other hand, as 
we shall see in the later sections of this paper there exists a wide variety of 
actual cases in which arise much more general statistical problems that can 
advantageously be treated by methods going beyond those of the theory of 
probability and that in many cases must be treated by such methods. Thus, 
it is our opinion that while the current theory of probability may consistently 
be regarded as a self-contained logical discipline, it may also be studied both 
mathematically and in more general ways, within the context of a broader 
point of view, in which the meaning of its basic concepts becomes clearer, as 
one comes to see their setting in the general domain of statistical problems, 
as well as the processes by which the laws of probability come to be valid, 
and the factors limiting the domains of validity of these laws. 


6. — On the General Statistical Problem as It Arises in Statistical Mechanies. 


Thus far, we have discussed the treatment of statistical problems in terms 
of the concept of probability. We shall now see that in the domain of ‘sta- 
tistical mechanics, a much more general notion of what constitutes a statistical 
problem is not only advantageous, but in fact, in many problems, absolutely 
necessary. 

In order to help bring out our point, we first review a few elementary ideas 
of statistical mechanics. Now, as is well known, the basic object of statistical 
mechanics is to predict various macroscopic properties of a system composed 
of a very large number of molecules (of the order of 102%), on the basis of sui- 
table hypotheses concerning the motions of its constituent molecules. ‘To 
describe the macroscopic properties, we may divide the six dimensional space 
having (x, p) as coordinates (which was called u space by EHRENFEST) (*) into 
cells that are so small that x and p do not change appreciably within them. 
Now if we consider a system composed of many molecules, the macroscopie 


(*) P. and T. EnRENFEST: Eneyklop. der Math. Wiss., 4, 4 (Leipzig, 1907-1914)- 
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properties of this system can be specified by giving the number of molecules N,, 
in each cell. Such a specification involves giving a series of numbers (M1, Magie 
Ni,» Ms) = [n]. In typical problems, the total number of molecules in the 
system is so great that the [n,] are all fairly large, even when the cells are 
chosen small relative to macroscopic orders of magnitude. 

On the other hand, the basic laws of mechanies work in terms of a miero- 
scopic specification of the state of the system. This is done by specifying a 
point in the 6n dimensional phase space of the system as a whole (called the 
I’ space by EHRENFEST). 

An important question that we now consider is that of computing the 
volume of /’ space corresponding to a certain set of the [n,]. As is well known, 
this problem is of crucial importance in calculating the statistical properties 
of the system (°). This volume is 


(6.1) te a 


where AQ is the volume in phase space of the cell corresponding to any one 
arrangement leading to the same [n,|. 

We now consider, as is usually done in statistical mechanics, a region of 
phase space lying between two hypersurfaces of constant energy, H,and H+ AE, 
where AF is quite small compared with £. As is well known, an isolated 
system moves along a surface of constant energy, so that if we study the pro- 
perties of such a shell of thickness AH and let AH approach zero, we shall 
obtain an accurate representation of the properties of the hypersurface on 
which such a system moves. 

The next problem is to obtain an approximation to the expression (6.1), 
which is valid when the n, are large. This is conveniently done in terms of 
the expression for In Q[n,]. We use Stirling’s approximation for the n,, obtaining 


(6.2) In Q[n,] ~ n Inn +In AQ — Yn; Inn. 


Now, Q[n,| has a very steep maximum. Thus, as we shall see presently 
a good approximation to this function can be obtained by evaluating the set 
of values of [n,] for which Q[n,| is a maximum, and then by expanding 


In Q[n,| as a series of powers of dn; = n; — nio, Where nj» is the value of n, 


for which Q[n,] is a maximum. 


(#) This problem has been treated in various ways by many authors. See, for 
= 7 Pe . s L) hi . il Ù 
example, P. and T. EnreNrEST: Hneyclopedie des Sciences Mathématiques, 4, 2, Sup- 


plément IT. 
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To obtain the maximum of In Q[n,], subject to the condition following 


from the isolation of the system, that the total energy is a constant, 
DI n,H; —E = 0) and that the total number of particles is a constant 


(Dn, — n=0) we use the well known method of Lagrange’ s parameters, i.e... 


(6.3) In 9 =InQ—-AMYn n) —uYmuE — E). 


The condition for a Maximum is then 


(6.4) an ind a-i-uE-lnE=0, 
On; 

or 

(6.5) n = Ny = A exp [—vE;]. 


As is well known from calculations carried out in statistical mechanics 4 comes. 
out equal to 1/k7, where k is Boltzmann’s constant, and 7 the temperature, 
while A is a normalizing constant chosen to yield Y n, — n= 0. 

DI 


For the case of molecules with negligible interactions, to which we shall 
restrict the discussion, the energy is just the kinetic energy, £, = 4mv; so. 
that we obtain 


(6.6) ni = A exp[—mw?/2kT]. 


This is the well-known Maxwell distribution of velocities. 

We see then that the Maxwell distribution corresponds to the largest pos- 
sible volume of / space. We shall now be interested, however, in the volume 
of I’ space corresponding to small deviations from the Maxwell distribution. 
To caleulate these volumes, we expand eq. (6.3) up to second powers in 
On; =n; — Ro Obtaining 


(6.7) In 2'= In QO, = £ > (6n;)?/Nio , 


i 


(6.8) a 7 exp le 4 DI Nio(Òf:)?] b) 


where df, = 6n,/ni is the fractional deviation from a Maxwell distribution. 

We see then that deviations from a Maxwellian distribution have a gaussian 
density in phase space. Indeed, because nip is large, the volume corresponding 
to appreciable fractional deviations is, in general, very small. In other words, 


4 


' 


THE GENERAL STATISTICAL PROBLEM IN PHYSICS AND THE THRORY OF PROBABILITY 1016: 


the maximum of Q[n,| is a very steep one (*). Indeed, it is so steep that one 
may express the result more pictorially by saving that a surface of constant 
energy corresponds to a Maxwellian ocean in which there are a few very tiny 
non-Maxwellian islands (+). Now, the usual treatment in statistical mechanics 
has been to assume that every point on the shell of phase Space, in which the 
energy has a value between £ and £-+ AF is equally probable. Such an as- 
sumption would evidently lead to the correct conclusion that the system 
practically always has a distribution very close to the Maxwellian; for the very 
small volume of phase space corresponding to the non-Maxwelllian distribution 
would then practically never be entered by the phase point. 

The question of the justification of the hypothesis of equiprobability on 
a shell of constant energy is discussed in texts on statistical mechanics (5). 
This question is, however not primarily what interests us in this paper. Hence, 
we shall here content ourselves with the statement that such a justification 
usually involves attempts to prove either a quasi-ergodic theorem or metric- 
indecomposability of the orbits in phase space, attempts which thus far have 
not been successful except for a few of the simplest types of systems (7). The 
point that interests us here, however, is that the attempt to prove equi-proba- 
bility may have little or no relevance to the problem of demonstrating the 
Maxwellian distribution. For because of the fact that the non-Maxwellian 
distributions correspond to such a minute volume of phase space, it is clear 
that almost any continuous probability distribution, uniform or otherwise, 
would lead to a distribution very close to a Maxwellian distribution. But 
more important even than this is the fact that a very wide range of deter- 
minate laws governing an individual system would still give an approximately 
Maxwellian distribution, for practically all of the time, as long as they did 
not imply an unusual tendency for the orbit to remain near the special non- 
Maxwellian regions. And since as we shall see in Sec. 7, we have a very large 
number of particles in unstable motion, it is quite plausible to expect that 
in systems of the kind with which we work in statistical mechanics, the actual 
orbit in phase space will tend to wander sufficiently throughout the accessible 
regions of phase space, so that it will not long remain in the non-Maxwellian 


regions. 


(*) A much more elegant and precise method is available for deducing this result. 
See, for example, the proof of the central limit theorem in A. I. Kuincuin: Statistical 
Mechanics (New York, 1949). The method that we have chosen here is, however, based 
om essentially the same ideas, and has the advantage of bringing out more clearly what 
is important for our purposes. 

(+) This fact has long been well known. See, for example, reference (>). 

(5) See for example, A. I. Kuincitn: Statistical Mechanics (New York, 1949). 

(7) We shall discuss a few such cases in Section 9. 
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We conclude then that the effort to base statistical mechanics on equi- 
probability in a given region in phase space is an unnecessarily narrow way 
of attacking the problem. In some cases (if there is time for the orbit to fill 
such a region to a good degree of approximation), it could be based on equi- 
probability (*). But in most problems, the real basis seems to be the insensi- 
tivity of large-scale averages in systems containing many components to the 
precise details of the motions of these components. Because of this insensi- 
tivity, one may have practically constantand determinate statistical averages, 
without having equidistribution, and indeed without having any kind of pro- 
bability distribution at all (*). Thus, the problem of finding an adequate 
basis for statistical mecnanics is greatly simplified, because it is evidently 
much easier to prove that the phase point does not remain long in the tiny 
non-Maxwellian regions than it is to prove that it covers the whole space in 
a quasi-ergodie or metrically indecomposable way. 

On the basis of the above results, we are then led to consider a new and 
broader point of view on statistical problems, which permits us to go beyond 
the mere calculation of one kind of probability in terms of another kind. 
Instead, we may consider what has been called in Sec. 1 the «general statis- 
tical problem ». This is to find which properties of a statistical aggregate are 
approximately independent of the variations that actually occur in the detailed 
behavior of the individual systems composing the aggregate under the con- 
ditions that define the problem. The proof of the quasi-ergodic theorem and 
theorems in metric indecomposability would then be special cases of solutions 
of this problem, because where these theorems are valid, the long-run pro- 
perties of the orbit in phase space are essentially independent of initial con- 
ditions. Thus, predictions of the mean behavior become possible without a 
detailed specification of the state of an individual system. 

We shall now further explain some of the consequences of the broader and 
more general attack on statistical problems that we have described above. 
In this connection, we have thus far considered two extremes, one of which 
is to deduce the statistical properties from probabilities, and the other of which 
is to deduce them from precise determinate laws governing the individual 


(*) In the problem that we are considering here, there is not enough time for this 
to happen. Thus, as a consequence of a theorem proved by PoIncaré (Acta Math., 
13, 67 (1890)), in a typical mechanical system, containing something like 1022 mole- 
cules, it would take a fantastically long time, of the order of trillions of trillions of 
years for the orbit to come reasonably near to every point of phase space. 

(*) This point seems to have been for a long time known to a number of writers 
in the field (see, for example, A. I. KHIincHIN: Mathematical Foundations of Statistical 
Mechanics, sec. 13). However, as far as we can tell its full significance with regard to 
the lack of necessity of using a probabilistic treatment of statistical mechanies does 
not seem to have been realized. 
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systems. We may, however, consider the possibility of deducing the statis- 
tical properties from intermediate types of laws, which are neither completely 
probabilistic nor completely deterministic. The need for such intermediate 
types of laws can be seen by considering the fact that no system can be de- 
scribed completely in terms of any finite number of properties. First of all, 
any System is always interacting with other systems outside of it. In many 
cases, these interactions seem to be small. But because the detailed motions 
of individual systems are unstable, they can in the long run, be affected a 
great deal even by small disturbances arising outside. Secondly, laws ope- 
rating at a particular level are generally subject to disturbances arising at 
other levels, not included in the laws in question. For example, the laws of 
the macroscopic level are subject to fluctuations of the type of Brownian motion, 
arising from the motions at the atomic level. But the laws of classical me- 
chanics operating at the atomic level are subject to further fluctuations arising 
from the quantum-mechanical properties of matter. And because of the insta- 
bility of motion, these fluctuations can have important effects on the detailed 
behavior of the orbits even at a level where classical mechanics would other- 
wise be a good approximation. Thus, one comes to the conclusion that in any 
real problem, in which one necessarily leaves out of consideration an unlimited 
number of effects coming from outside from the system and from other levels, 
a precisely determinate and complete causal prediction cannot be obtained. 
In general, therefore, we expect to obtain from our laws predictions only of 
the mean behavior of a system, and of the range and general character of the 
fluctuations away from this mean, arising from the infinity of factors that 
have not been taken into account. Because of the insensitivity of large scale 
averages in statistical mechanics, however, a very wide range of laws of the 
type described above is evidently possible, which lead to approximately the 
same large scale statistical properties (such as the Maxwellian distribution). 

On the basis of considerations of the type discussed above, one is led to 
consider the question of whether one ought not to seek a more general sta- 
tistical formalism (*) and not always to try to frame all statistical problems 
in terms of the calculus of probabilities. Of course, the theory of probability 
has demonstrated its applicability in a wide range of fields, starting with 
gambling games, and going on to the fields of medical, social, industrial, eco- 
nomic, and scientific statistics. Nevertheless, as we have already seen, in our 
discussion of the foundations of statistical mechanics (certainly a study that 


falls within the scope. of statistical problems), the theory of probability already 


(*) This problem would be a mathematical retiection of the general statistical 
problem that we are raising here. The notion that new statistical formalisms, other 
than the currently used calculus of probabilities, might be possible was first suggested 
to us by Prof. MArto Scu6NBERG (Private communication. ). 
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seems to have little relevance. To be sure, it is possible to justify statistical 
mechanics on the basis of the hypothesis of equiprobability in phase space, 
but in most problems, this hypothesis can be shown not to have much to do 
with the real problem (*). What is significant in the real problem is that large 
seale averages are insensitive to most of the details of the molecular motions, 
so that for a wide range of possible motions, approximately the same macro- 
scopic averages are obtained. The fact that one may within a certain context 
of problems, obtain correct results using the irrelevant hypothesis of equi- 
probability, is then a consequence of the close relationship between the equi- 
distribution of probability that would arise in an extremely long period of 
time and the properties of the motion that lead to the statistical laws that 
are valid over periods of time of practical interest. One may compare the 
probabilities that would apply in an extremely long period of time to a sort 
of shadow, the behavior of which can more or less faithfully represent some 
of the statistical properties that are relevant in shorter periods of time. Our 
point of view is then that it could be profitable to study the essentials of the 
problem more directly rather than to study nothing more than the proba- 
bilistic « shadow ». 

As an example of how statistical problems can be treated in ways other 
than those based on the theory of probability, let us consider the problem of 
finding the meaning of the entropy on the basis of the molecular properties 
of the system. Now, it has been customary to define entropy in terms of pro- 
bability; and for a broad range of purposes, this definition has served to make 
possible the deduction of the thermodynamic relations and the calculation of 
some of the thermodynamic properties of various systems. But from the work 
of P. Hertz and LEVI-CIVITA (7) one sees that entropy may be associated 
with mechanical properties of the system rather than just purely probability. 
Thus it seems quite likely that new models for the entropy could be worked 
out. We do not wish to imply that either the probabilistic or the mechanical 
model necessarily exhausts all that there may be to the concept of entropy. 
Indeed, still other models may perhaps be proposed later which give us even 
deeper insight into the meaning of entropy. But the mere fact that models 
different from the probabilistic one can lead to similar results in a broad domain 
is a sign that it would not be wise to identify entropy completely with the 
current probabilistic model, just because this model has led to correct results 


(*) Thus, because of the extremely great length of the Poincaré cycles, there is 
no time for a system to come reasonably close to every point on a surface of constant 
energy in phase space. 

(*) P. Hertz: Weber-Gans, in Repertorium d. Physik, Bd. 1, 2 (Leipzig, 1916); 
T. Levi-Crvita: Drei Vorlesungen diber adiabatische Invarianten, in Abh. d. Math. Se- 
minars Hamburg, 6, 323 (1928). 
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in a certain domain. Rather, the way should be left open for new statistical 
concepts, and associated mathematical methods, which may shed new light 
on the meaning of entropy. 

Another important application in which the concept of probability may 
have only a rather limited validity is the study of non-equilibrium problems 
in statistical mechanies. If we are interested in how a System approaches 
statistical equilibrium, then a certain difficulty of principle arises in connection 
with the application of the concept of probability. For the statistical pro- 
perties of a system, (e.g., pressure, temperature, velocity, etc.), are changing 
with the time. The usual method of defining probability as the average over 
an unlimited period of time then will not work. Of course, if the statistical 
averages are changing slowly, one may define the probability approximately 
in terms of the average over a period of time during which these properties 
do not change very much (‘). For rapid changes, however, even this method 
does not work. The other possible method of defining probability is through 
a statistical ensemble. But it seems unsatisfactory to have to treat what is 
after all the approach of a single system to equilibrium in terms of an average 
over a non-existent ensemble. Such a treatment may give correct results in 
certain contexts, but again it is rather like studying the motion of a thing in 
terms of the motion of its shadow. Perhaps new statistical methods and 
concepts would here make possible a deeper understanding of the problem of 
the approach to equilibrium in such systems. 

Even if we restrict ourselves to statistical laws that are expressible in terms 
of the laws of probability, our point of view shows that one ean still treat a 
broader range of problems than just the calculation of one kind of probability 
in terms of another. For within the setting in which statistical law is regarded 
as More general than probability, it has meaning to demonstrate the validity 
of the laws of probability in certain contexts, and to calculate the numerical 
values of the probabilities, on the basis of laws that are more general than 
the laws of probability. Thus, as we have seen, we can calculate a probability 
from a determinate law, or from an intermediate type of law, which is partially 
determinate. Vice-versa, a partially determinate law can be obtained from a 
probabilistic law, when the number of elements N, in the distribution is large. 
For example, according to Bernoulli’s theorem, one can for practical purposes 
predict that the error in a certain average will decrease as N-*. As N appro- 
aches infinity, the partially determinate law will become more and more nearly 
a completely determinate law. Thus, we see how the adoption of a broader 
point of view toward statistical law permits us to attack the problem of the 
interconnection of various kinds and levels of law. 


(8) See, for example, M. ScHONBERG: Nuovo Cimento, 10, 419 (1953) where a dis- 
cussion is given of the problem of taking averages in a non equilibrium process. 
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7. A Further Example of the General Statistical Problem - The Deduction 
of a Probabilistic Law from a Determinate Law. 


In the previous section, we have seen in terms of an example drawn from 
statistical mechanics, in which one deals with a system containing a very large 
number of particles, how statistical relationships can be treated without the 
need for their being expressed in terms of the theory of probability. In this 
section, and in the next, we shall go to the opposite extreme and consider & 
system described by a single variable only, satisfying a simple determinate 
law, a law in which, however, the motion has the essential characteristic o. 
extreme instability (*). We shall then see that this law leads to a long rut 
behavior that is independent of initial conditions, a long run behavior tha. 
is just what is predicted by the theory of probability. We shall also see thati 
as happened in the many-particle systems treated in statistical mechanics: 
essentially the same kind of long-run behavior will follow from a wide range 
of laws that are between the determinate and probabilistic types. Thus, th 
justification of the laws of probability in a given application is seen to be « 
part of the general statistical problem. 

In order to motivate the example that we are going to choose, let us comi 
sider the path of a molecule in a gas, which is contained in a box. We sup 
pose that this molecule can be represented approximately as a hard elastii 
sphere. Now, it is well known that in its collisions with other molecules, thii 
particular molecule will follow a very complicated path, in wich the velocit: 
undergoes a wide range of rapid fluctuations, both in magnitude and in di 
rection. In the long run, however, the probability that this particular mole 
cule will occupy any given small region of space is the same as that it occupt 
any other such region having the same volume. Likewise, the probability th 
the direction of motion lies in any specific range of angles becomes uniforit 
in the long run. 

Let us now analyze how such a uniform probability comes about. W 
first note that the path of any given molecule possesses an extreme instability] 
Thus, suppose the initial position of the i-th molecule to be x°, and its initia 
velocity v. We now consider a collision of this molecule with any other one 
say molecule j. Focussing our attention on the path of molecule i, we sel 
that the change of angle of motion A6,, is evidently very sensitively dependeri| 
on the collision parameter, h, (shown in Fig. 1) and therefore on the initisi 
position and velocity of this particle (+). 


(*) Porncar® has already pointed out the importance of instability in this proble: 
We shall discuss Porncaré’s work and compare his results with ours at the end of ti 
section. 

(+) It also depends on those of particle 4, but in this discussion, we are interest 
only in following what happened to particle è. 


i 
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In fact, if its initial position is changed by an amount of the order of the 
size of an atom, its angle of motion after collision will change by an amount 
of the order of 27. 


A Since the change of position suffered by the particle 
te between the times ¢, and #, is (as long as no collisions 
| os Qi occur between #, and t,) 
| VAS 0) 
! scattering particie Ax; = a(t. iar ti), 


scattering particle 


Fic. 1. sens x 2 ee È 
; it is clear that in a series of collisions, there is a con- 


tinual multiplication of the instability, so that after 
a sufficiently large number of collisions, a change of initial position of less 
than the size of an atom can carry the particle to a very different region of 
space and to a very different direction of motion, 
as shown in Fig. 2. @ 

It is already clear intuitively that under these ers 
conditions a very irregular motion will develop 
which does not favor any particular region of ‘Bs 
space. Thus the equiprobability of all regions of SO 
space becomes plausible on the basis of the kind pa 
of motion that exists. RIZZO) 

It would evidently be desirable, however, to Ò 
prove this equiprobability in a more rigorous Be ey 
way. Unfortunately, in the problem described Fait 
above, the mathematical difficulties attendant on O: 
such an exact proof have not yet been resolved. Fig: 2. 

We shall therefore propose here a simplified 

example, which permits an exact treatment, and which still retains the es- 
sential property of instability of motion, that is evidently at the root of the 
development of an equiprobability in the distribution 
of colliding molecules. 

In this model, we consider a point, P,, in a two 
dimensional space. The coordinates, #,, Yn, repre- 
sent the location of the n-th particle during the 
n-th collision, while 0, represents its angle of motion 
after this collision, as shown in the figure below. 
We now assume that the particle, makes a new col- 
lision after moving a definite distance, L, (In reality, 
this distance L fluctuates from collision to collision; 


but by regarding this distance as a constant, we are further simplifying the 


UCI 
w 


Pig. 


problem.) i 
We further assume that in the process of collision, the angle 0, is mul- 
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tiplied by a large integer, A, or that 


(7.1) Oy = KO, 2 


We see that this hypothesis embodies the essential element of instability, , 


since a small change of angle /0, produces a much larger change of angle, 
A0,+, = KAO,. A change 40, = 2x/K produces a change 40,4, = 2x. But, 


’ 


whenever the angle changes by 27 it comes back to its original value. Thus, , 
we should write 9,,, = X0, — [K0,] where [K6,] is the largest multiple of 27 : 


tiny EGG). 

We see also that our simplified model gives a schematic representation of 
the significant aspects of the actual process of collision. First we note that 
the angle 0,,,, changes by 27 when the initial position of the particle just before 
it strikes the (n-+1)-th obstacle changes by DAO = 2xL/K. Now if K is large 


x 


it is clear that a small change of position will lead to a change of angle of 27. . 


Thus we embody in our model something similar to the extreme sensitivity 
of the angle of collision to the collision parameter, that is so important in 
real collision processes for bringing about a uniform distribution. Im fact, 
one could regard this model as more or less equivalent to having relative to 
each collision point a uniformly distributed set of obstacles on a cirele of 


Ù 


È 


radius L, each of diameter 27L/K. Of course, in any given collision, the lo- - 


cations of the effective obstacles will depend on where the previous collision 
took place; whereas in the real process it does not. In this sense, our model 


| 


is a further simplification; but in the present work, the essential aspect of insta- - 
bility of motion is the principal feature of the real process that we are inte-- 


rested in retaining in the model. 

Let us now consider a sequence of collisions, starting with 0 —6,. We 
have 
a OS 


e = KO, —[K*6,], 


TRE DI TN 
Che OG, phe ues 
We then write our sequence associated with N collisions as 


S(N, 0) si (00 0, PESO) 0,} 9 
Note that each 0, determines an entire sequence. 

We now divide 0 into elements of size AQ, of small, but fixed dimensions. 
And in each sequence S(N,0@,) we can ask how many values of 6, fall inside 
the 7-th element (located near 9--0;). : This quantity we define as o(N, 0, 0,)40;. 


a 
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Now, in order to demonstrate the development of a uniform distribution, 
what we must do is to show that for arbitrary 0, (with the possible exception 
of a set of measure zero) o(N, 6, 0;) approaches a constant, independent. of 
O, and 0; when N approaches infinity. 

There exist well known theorems in the theory of numbers, which permit 
us to prove that in our case (as well as in a much wider class of similar prob- 
lems), 0(N, 9), 0) does indeed approach a constant (9). In the present paper, 
however, we shall demonstrate our results in another way which has the 
advantage of bringing out more clearly the essential features of the processes 
leading to equiprobability than is possible with the more abstract methods, 
such as those used by Weyt. 

Let us first consider a sequence in which the particle arrives in the region 
40, after the n-th collision. We now ask: What would have been the values 
of 0 before the n-th collision? One possible set of values could have been a 
region of width /0,-; = 40,/K near 0,_;=0,/K. But since the physically 
significant values of 9 run between 0 and 27 the particle could also have come 
from any one of K regions of similar width located near (0,/X)--2xM/K 
where ¥ is an integer running from 1 to A —1. Thus, the possible regions 
are uniformly distributed around the circle. (See Fig. 4). 

But each one of these regions A6,-, could have come 

from a set of similar uniformly spaced regions of width 

A0,- = 40,-.|K = 40,/K®. If we continue this analysis i 
back to 0,, we discover that the values of 0, which ean lead 

to a given Ad, are situated inside A” uniformly distributed 
regions, each of width A40,=40,/K". pa 

Let us now consider any given sequence, S(N, 0) where 
9, lies in the range AO,, 6, in 40,, ete. For the sake of 
convenience, we choose all of our ranges 40, ..., 46, equal 
to each other and equal to 27m/K where m is an integer which is much 
smaller than /, but large compared with unity. Thus, the unit circle is 
divided into K/m regions, each of width 2am/K. 

We now ask « What is the range of 9 that can lead to any given sequence, 
in which the particle goes through the regions A6,, 16.2, ..., 40,? ». This quan- 
tity, we call the « measure » of our sequence. 

Now, as we have seen, there will be A uniformly distributed regions 
of width 60,=A6,/K from which the particle started, if it arrived in 
the region, 46,. The total measure of these regions is K AG Ko == A05. 
But now, if the particle goes through the region 4@,, there are likewise A 
uniformly distributed regions of the angle 0 each having width 416,/A, from 


(9) See, for example, H. Wryt: Math. Ann., 77, 313 (1916). 
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which the particle could have started its second step. But of these, only m 
will be in the region of width /0,, in which the particle was actually located 
just before this second step. Hence, the measure of those trajectories passing 
through both Ad, and 10, will be mA6,/K= 40,:(40,/2x). Similarly, the mea- 
sure of those trajectories passing through A6,, 40, and A6,, will be AQ): (A0,/27)° 
and that of those passing through 46,,..., 46, will be 46,-(A0,/22)". Thus, 
only a very small fraction of all possible initial values of 9, can lead to a se- 
quence of angles passing through a given set, /10,,..., 40,. The problem is 
very similar to that of having to get a ball to go through a series of irregularly 
spaced holes, if the ball is deflected as it passes through each hole by an amount 
that is very sensitive to the distance from the center. To get the ball through 
all of the holes, a fantastically good aim would be needed. 

Our next step is to find the measure of those values of 0, that can lead 
to a given distribution, N; = e(N, 0,0) 40. 

Now, as we have seen, any given sequence has a measure of A6-(A0/2z)”. 
We must now count all sequences that can lead to a given o(N, 6), 6;) and 
therefore to a given set of NV;. This number is (*) 


WN! N,!... Ni. 


Thus, the measure of those values of 0, leading to a given distribution of the 
N; IS 


ta N! 
( [| To) M= COREANI AI (10/21) È AO . 


z 


Using Stirling’s approximation, we get 


(7.4) In (M/40) ~ Nin N — > N,N, + N In (AO/2a)" . 


i 
We obtain the result that the distribution having the highest measure is 


(7:5) N; = constant = N = NA0)2x. 


(*) Here, we make use of the fact that if any given sequence, (6), N) is possible 
then any other sequence involving an exchange of any two of the angles is also pos- 
sible. This follows from the large value of the integer, A, which has as a consequence 
that given any values of 0,, we can always obtain any desired value of 6,1, simply by 
a small change of 6, that does not take it out of the region AQ, of width 2am/K- 
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By defining 6N; = N; — N, we obtain 


(7.6) In M = In M, — > (6N;)?/2n = In M, — > 22(6N ,)?/2N AO . 


z 


Writing for the fractional deviations, df; = ON ,|N = 226N;/N AO we get 


(7.7) Ma M, exp [— ¥ 4(6f,)2(NAG/22)] . 

We see then that the measure of those distributions in which Of; is appre- 
ciable becomes negligible as N > co. Thus, fixing a given Af and a given 
Ofin, however small, we can always choose N so large that the measure of 
those sequences in which df, > df. is less than any specified number. 

What is the meaning of this result? We first remind ourselves that all of 
the possible sets of initial angles, that can lead to a given distribution, 
N; = 0(0,,0;, N) arrange themselves into small elements of width, A0/K”. 
Thus as N — co, the width of each element approaches zero very rapidly - 
This behavior corresponds to the extreme instability of motion, since a change 
of Ab, = 40/K" would change 0, by AO and therefore alter the distribution. 

Thus, to get any given type of sequence, as N + co we would need a 
perfect fit for the boundary conditions. But as we have seen, the overwhelming 
majority of sequences correspond to distributions that are very nearly uniform. 
Only a comparatively small number of regions of width /10/K will lead to 
anything different (such special regions will occur, for example, where 0, is 
rational). Although the number of such special regions approaches infinity 
as N — oo, this approach is much less rapid than is the decrease in width 
of each region, so that the total measure of such special regions approaches 
Zero, as N > oo. 

With regard to the theory of probability, the essential meaning of the 
above result is that the long run average distribution of angles is practically 
independent of the initial angle, 0,. It is true that there is a special set of 
measure zero that could give rise to non-uniform long-run distribution, but 
one can show that such special values of 0, play no role in real problems. To 
see why this is so, we note that in order that one of these special non-uniform 
distributions should come about, it would be necessary that the value of 0, 
be defined exactly (i.e., with no error whatever). We know, however, that it 
is impossible in reality to have a physical system in which the value of a 
continuous variable is defined as eractly equal to any given number. For every 
system is always interacting with other systems in the environment which also 
undergo irregular fluctuations in the motions of their elements (such as atoms 
and molecules). Thus, even if a particle accidentally did have its initial value 
of 9, exactly equal to one of the special set of measure zero described above, 
it would be sure to obtain a slight disturbance from its environment; and 
because of the instability of motion, even the slightest disturbance would carry 
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it far from this special state and lead eventually to a uniform distribution. 
Now, it is true that such a disturbance might accidentally throw the system 
into another of the special angles, 0,, of the set of measure zero, not leading 
to a uniform distribution. This would require however a very special exact 
coordination between the external disturbance and the motion of the system. 
It is evident that for practically all possible motions, this kind of coordination 
would not exist. Thus, it is consistent with the theory to suppose that the 
motions are such that there are no such coordinations. Moreover, this sup- 
position is also consistent with all of our practical experience. For example, 
if we consider a cone standing on its point, this position is one of unstable equi- 
librium. It would be conceivable that the fluctuations of the surrounding air 
and of the table on which it is supported would be just such as to maintain 
it in its position of unstable equilibrium for all time but evidently this never 
happens in reality (*). 

We conclude then that the treatment of the sequence given here is a special 
case of the general statistical problem, in the sense that certain long run or 
average properties can be predicted without taking into account the precise 
factors needed for a prediction of the details of the behavior of the system. 
For as we have seen, once we exclude the special set of 9, of measure zero 
described above, then the mean distribution of angles is independent of 0, 
and is therefore independent of the boundary conditions that determine the 
precise motion. We conclude then that the average value of any integrable ; 


% N 
function of 6, f(0) = Tim (1/N) Y f(0,) will, independently of initial conditions, 


be equal to ia 


The above is evidently equivalent to a statement of the equidistribution 
over angles. Thus, as far as computing averages of any function of 0 are + 
concerned, we obtain the same result as that which follows from using the + 
theory of probability, with the probability distribution, (0) set equal to a 
‘constant. 

We shall now compare our results with those of PoINCARÉ (!°), who treated 
a similar problem, but by different methods. Porncar® realized clearly the 


(*) It is interesting to note that CourNOT in his Kwposition de la Théorie des Chances 
et des Probabilités already realized the importance of this point, and in fact used the» 
example of the cone standing on its point as an argument in favor of the thesis that: 
in chance phenomena, no special properties depending on the infinitely precise speci. . 
fication of any particular quantity could be of physical importance. 

(!°) H. Potnearé: Caleul des Probibilités (Paris, 1912). 
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importance of instability in leading to an equidistribution. Indeed, he con- 
sidered the case of a packet of asteroids, each in neighboring, but slightly dif- 
ferent circular orbits. Because the phase angle of any particular asteroid rela- 
tive to that of another is unstable (i.e., it increases indefinitely with the time), 
one expects that in time the asteroids will have a uniform and more or less 
random distribution of phase angles (which is in fact found in astronomical 
observations (*)). 

In order to treat this problem, POINCARÉ considered an arbitrary continuous 
initial probability distribution (e.g., one having a very narrow but continuous 
peak near a certain point). He then showed that independently of the precise 
initial form of such a distribution, the probability would eventually become 
a constant that is independent of the angle. For discontinuous initial distri- 
butions (e.g., a delta function) this would not in general occur. But here 
POINCARE used an argument similar to the one by which we have excluded 
sets of measure zero as not representing real physical possibilities. He did 
this by pointing out that in all real problems, there is a background of flue- 
tuations coming from influences outside the scope of what is treated in the 
problem (e.g., the motions of other planets, and even the molecular motions) 
so that no discontinuous probability function (such as a delta funetion) ean 
represent any real physical situation. 

POINCARA’s conclusion was thus essentially the same as ours. But the 
weak point in his argument is that his mathematical methods do not cor- 
respond to the real context of his physical ideas. Indeed, from a strictly 
logical point of view, the assumption of the initial narrowly peaked and con- 
tinuous initial distribution is not fundamentally different from making as- 
sumptions of the ultimate uniform probability distribution to begin with. It 
would thus be concluded that while PorncARÉ has demonstrated an interesting 
and useful relationship between two probability distributions taken at dif- 
ferent times, he has still remained within the basic frame-work of just cal- 
culating one probability in terms of another. 

On the other hand, the significance of our result is that the development 
of an equidistribution can be understood without any reference to the concept 
of probability at all. This we have done by recasting WEYL’s results on the 
development of an equidistribution in a form in which the physical signifi- 
cance of the sets of measure zero could easily be seen, so that we can under- 


stand why they must be excluded (*). 


(*) The rings of Saturn, made of small-meteor-like particles, present a similar 


problem. 
(+) In section 
Porncarh can be treated, without the use of probability. 


10, we shall see how an example similar to the one discussed by 
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Finally, we shall consider the possibility of proving equi-distribution in 
terms of laws intermediate between determinate and probabilistic types, dis- 
cussed in the previous section, in connection with statistical mechanical prob- 
lems. It is important to consider such problems, because the notion of a single 
mechanical variable, following a determinate law, defined with unlimited pre- 
cision, is evidently an abstraction. In reality, every such mechanical variable 
is subject to fluctuating perturbations of the type that we have mentioned 
in the previous section and in the above paragraph. In other words, it is in 
reality coupled, even if only slightly, to an infinity of other parameters. The 
extreme instability of motion could conceivably make such coupling significant 
over a long time. It is important therefore to show that there exists a wide 
range of cases in which the equidistribution would not be significantly affected 
by such perturbations. 

In order to do this, we introduce the notion of the sequence space. The 
coordinates of a point in this space are specified by the values, (@, P1, ---; Py) = 
=[g,| of the various angles in any specified sequence that we wish to consider. 
Moreover, once we have chosen gy, then any point in the subspace (q, ... Py) = 
=[Pni Pol, which starts at p=, corresponds to a particular determinate re- 
lationship defining every term in the sequence in terms of pp. (The relationship 
Yn = Kg, is evidently a special case). By considering a range, dq, ... dg,, 
of points corresponding to a given @, we are effectively considering a range 
of possible relationships to determine the sequence. 

Now, a8 we did in our discussion of statistical mechanics in Sec. 5, we can 
define the range of sequence space corresponding to a given distribution of the: 
angles, 0 (instead of the range of phase space considered in Sec. 5). We need 
not go through the mathematical details here, as they are very similar to 
what was done in the statistical mechanical problem, but we shall merely quote 
the result here. : ; 

An equidistribution of the angles, 9, corresponds to the maximum possible 
volume of sequence space (*), distributions near the uniform one corresponding 
to a volume of sequence space that decreases according to a Gaussian curve. 
When N is large, this decrease is so rapid that almost all of sequence space 
corresponds to a uniform distribution. 

The above result has a number of interesting consequences. First of all, 
we see that the special law (py, = K"g) that we are discussing is not essential 
to an equidistribution, so that there are a wide range of determinate laws 


(*) In the statistical mechanical problem, the Maxwellian distribution, instead of 
the uniform one, was obtained as the one corresponding to maximum volume in phase 
space, because of the requirements of conservation of energy. No analogous requi- 
rement appears in the present problem. 


e 
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that could lead to the same statistical results. Indeed, even a law intermediate 
between determinate and probabilistic could have the same statistical effects, 
provided that as N increases without limit, it is quite rare that the point in 
sequence space finds itself inside the minute volume corresponding to distri- 
butions deviating appreciably from a uniform one. 

The above result must, however, be interpreted with care. It does not by 
any means signify, for example, that all distributions must be uniform. Laws 
are evidently quite possible that would lead to non-uniform distributions, or 
even to distributions that approached no particular limiting behavior at all. 
All that we have shown is that a wide range of laws exist that could lead to 
a uniform distribution, so that the appearance of such a distribution need 
not be sensitively dependent on having a certain special kind of law determining 
the sequence. Moreover, we can also understand why a wide range of per- 
turbations of the system need not disturb the uniform character of the distri- 
bution significantly. 


8. — The General Applicability of the Calculus of Probability. 


Thus far, we have considered only the development of equidistribution in 
the mean behavior of the angle, 0,. The calculus of probability deals, however, 
with properties that are more general than just the mean behavior of a simple 
event. It deals also with the mean behavior of various combinations of events. 
Thus, in the case of throws of a die, the theory of probability permits us to 
estimate the relative frequency with which we shall obtain two sixes in a row, 
by multiplying the probabilities for each result separately, according to the 
well known rules that apply for combinations of « independent » events. More- 
over, the theory of probability also applies to problems in which the proba- 
bility distribution is not uniform. 

In this section, we shall show in terms of the same sequence treated in 
Sec. 7 (0,= K"0,) how a determinate law can lead to the validity of the rules 
for combining probabilities. In the course of the treatment of the above 
problem, we shall also show how all of our results can be generalized to cases 
in which the probability distribution is not uniform. 

The first problem that we shall consider is that of estimating the mean 
number of times in which a given angle 0, is followed by another specified 
value, 0;. More complex combinations involving three, four or more angles 
may evidently also be of interest. 

To treat this problem, it is convenient to introduce as an intermediate 
concept the notion of conditional probability. Thus, in our problem, a typical 
conditional probability is p;;,d4;,d0;, the probability that if the angle after 
a given collision lies between 0; and 0; |. db, the angle after the next collision 
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will lie between 0, and 0; + d0,. It is evident that the probability of the com- 


pound event 7,2, is then 
(8.1) dP,i, a Pi, ‘Pai, d6, db, ° 


A special case of considerable interest arises when p,,, is independent of 7,. 
In other words, the probability distribution for the second angle does not 


depend on the first angle. It is easy to see that in this case p;.;, = pi. Thus’ 


the probability of the compound event becomes 
(8.2) dP,, = Di, p:, 40,00, 


and we obtain the well-known product rule, which apples for « independent » 
events. This rule implies that thereis no correlation between successive angles. 
For if there were correlations the probability of obtaining da would depend on 
which angle came before. 

We may also be interested in the conditional probability if the angle after 
a given collision lies between 0, and 0; +-d6; the angle after x, additional col- 
lisions will be between 0; and 040; This we denote by 


i, 00,40;, - 


Moreover, we may be interested in combinations involving more events. 
Thus, we wish to define probability that if after a given collision the angle 
is between 0; and 0; +d0;, and after x, additional collisions between 0; and 
0; +00; the angle after still another «, collisions will be between a, and 
9;-:d0,. It is evident that arbitrarily high order conditional probabilities 
can be defined in this way. The probability of a compound event (ii, ... i,) 
is then 


(8.3) Ue ea SR) 


Vo we 
Cir K te K 


A case of great interest is one in which Dai is independent of 7,, p; ; ;, Inde- 
. . . . . = . #4, 
pendent of î,, è», etc. It is readilly verified that for this case 


(3.4) Pi,i, ia Dakar. ae Dies wee 9 Pig ig = Pig 
and 
(3.5) dP,: “ik == PiPi, eee Pix db, eee db;, 5 


This is the case of complete absence of correlation up to order AK, or of 
complete « statistical independence ». 
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We shall now show that with our sequence, 0, = K"§,, the conclusions 
drawn from using probabilities of compouna events are valid. To do this. 
we must deal with two questions. First, do quantities that would correspond 
to the probabilities of compound events exist (i.e., do the associated relative 
frequencies approach limits independent of 6,)? Secondly, if so what are the 
values of these quantities? 

In order to study this problem, we generalize the statement of equi- 
distribution given in eq. (7.8). First of all, eq. (7.8) applies only for uniform 
probabilities. One could however define a more general probability, P(@) such 
that 


(3.6) (TIENI | (0) d0 = Jim = DIF 


0 


which would hold even if the distribution were not uniform. In order to prove 
that a probability, P(0), really exists for a given type of sequence of angles 0,,, 
it would be necessary to show that it is possible to find a funetion such that 
eq. (8.6) holds independently of 0, except possibly for a set of measure zero. 
In Sec. 7 we have already done this for the sequence, 0, = "9, in which 
case it turns out that P(0) is a constant. Evidently, however, a more general 
problem can be attacked in a similar way. It is not difficult to find a simple 
determinate sequence leading to a non-uniform P(0). In this paper, how- 
ever, we restrict ourselves to merely pointing out that the mathematical frame- 
work is already broad enough to permit the treatment of problems involving 
non-uniform probabilities. 

The probabilities of compound events can be treated similarly. To do 
this, we consider any integrable function of two variables f(9,, 0). Let us 
recall that in our sequence both 0, and 0, sa, are determined in terms of 6). 
Thus, once 6, is given, 0,,,, is certainly determined. In general, such a de- 
termination would lead to correlations (e.g., 0,,,, could be equal to 0,4%). 
We shall see, however, that in our problem, the relation between 0,,, and 0, 
fluctuate sufficiently as n changes so that no such correlations exist. 

If a probability P, (0,0,) of the compound event exists, then we must have 


(8.7) lim WSN Ome pie || Bal 0.) f(0, 0) 0,40, | [PO 0,) AO, AD. . 


Assuming that a suitable function, P_ (9,02) exists, then evidently P, (9,92) 
can be interpreted as the probability of obtaining an angle between 0, and 
9,--d0, in the n-th step followed by an angle between 0, and 0,-+-d0, in the 


(n + «,)-th step. 
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For the special case in which P(0,0,) = P(0,)P(0,) there is then no cor- 
relation between 0, and 0,,,,, While a still further simplification is obtained 
when P(0,) and P(0,) are both constant. 

To apply eq. (8.7) to our sequence, we shall restrict ourselves to f(0,, 0,,,,) 
that can be approximated adequately in terms of a finite but arbitrarily large 
number of terms in a Fourier series 


DIE: Gim exp [1(10, + mObn..2,)]- 


um 


i 


1 N 
(8.8) a Jeu ae Oar | 


As we shall see later such a function is general enough for most applications. 
We then have (using 6,,, = K™6,) 


N Il i 
(3.9) N > (On, Dn + ) i Di Cina exp [70,,(0 a= ie KK) | . 
NI ua i Ne 


We then write 0, = K”6, and ((+mK") = C,,, (provided that C,,,40). Then 
replacing 0 by 0,m= Cimbdo we get the sum in eq. (8.9) 


N 
> Gin = x. >> exp [iK’O,,] - 


l,m 


‘Since the distribution of angles, 6, = X"0,,, is uniform, each term in the: 
above double series will vanish (except for a set of 0,,, of measure zero). 

This conclusion, however, does not apply for 0,, = 0 and since K was 
assumed to be an integer in our original proof of the equidistribution, terms: 
with C,,,=0 can always appear. We note however that WEYL has shown. 
that the above sum is zero except for a set of measure zero even when K is. 
not rational. To simplify the problem we shall therefore suppose from now 
on that is irrational, so that C,,, is never zero, unless /=m—0. We therefore: 
conclude that (except for a set of 4, of measure zero) 


‘ it N 
(8.10) Ia 
r= a 


But if we assume a probability, P(0,0,) = constant, then in eq. (8.7) we: 
have 


(8.11) lio, 0.) dg dee ae 


Thus, we have proved that independently of 6), the averages of arbitrary’ 
functions, {(0,0,) that are expansible in a finite number of terms in a Fourier 


3 
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Series, are the same as those obtained from a probability P(09,0,) = constant. 
Hence, we have also proved that there is no correlation between 6, 
and 0, 

It is evident that these ideas can be applied to the case of the probabilities 
of K-fold combinations of events: and a simple extension of the above results 
shows that 


(8.12) igo (Mal 0,) = constant, 


so that the A-fold correlations vanish, where A is arbitrarily large. 

At this point, it should be noted that the set of values of 6, for which the 
distribution of the simple events 0(0), is not uniform, will not in general co- 
incide with the sets 0,,, for which the various distributions of compound events 
fail to be uniform. Thus, if 4, is one of the values for which 0(9) is not uni- 
form, then as we have seen, 0;m = Cino = (l-+mK™)0, will be a value for 
which the distribution of twofold events corresponding to a separation of «,, 
fails to be uniform. Hence, the more kinds of distributions that we take into 
account and the more terms that we take in the Fourier series, the more values 
of 0, there will be for which some distribution fails to be uniform. Of course, 
as long as the number of distributions and the number of terms in the Fourier 
Series is finite, we can eventually obtain a number of elements of the sequence 
as a whole, N, so large that the total measure of all these special angles is as 
small as we please. Nevertheless, the more different kinds of distributions 
and the more terms that are used in the various Fourier series, the larger N 
will have to be in order to make the measure of this special set less than any 
number, e, which represents the precision that is significant in the conditions 
of the problem of interest. 

In order to bring out more clearly what we mean by the notion of signi- 
ficant precision, let us first note that in each problem there will in general 
exist some range of angles, 46, such that no important properties within the 
context of interest will depend on a definition of the angle with a precision 
greater than AQ. We then introduce the so called characteristic function, 
z of the set of angles of interest, which is zero outside the set and equal to 
unity inside the set. The mean value of x is then equal to the probability 
that the system will be within the range 40. Now, to approximate a function 
such as y, we need a Fourier series having a number of terms of the order 
of 1/40. Thus, we see how the number of terms in the various series is de- 
termined. 

In order to show what determines the number of distributions of com- 
pound events that needs to be taken, we first point out that in any particular 
problem, only a finite number of combinations of events will in general be 
important. For example, in an effort to «beat» a game of coin throws, a 
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player might choose a «system » of plays, such that if he obtains a head, and 
after x, throws, another head, he will then bet on heads for the play coming 
x, throws later still. The vanishing of three fold correlations is enough to gua- 
rantee that this systems will not «beat» the game. More general systems. 
could be invented such that he would bet only when the results appeared in 
some other specified but predetermined way involving any desired number 
of throws, and as long as the corresponding combination of events has a uni- 
form distribution, he cannot win systematically by any such choice. A dif- 
ferent example which is more complex, but which illustrates the same principle 
comes from statistical mechanics. Thus, the total potential energy of the 
molecules of a gas is H = 4} >,, V(x; —x,;) where V(x; — x,;) is the potential 
energy of interaction of a pair of molecules. The mean potential energy would 
then evidently depend on the two-fold correlations in the position distribution 
of the molecules. 

From the above example, we see that the combinations of events that are 
important may depend partly on what is of interest to us, and partly on the 
physical conditions of the problem. q 

When the distribution of arbitrary combinations of compound events is 
uniform so that the distribution possesses the property of statistical indepen- 
dence, (as is true in our sequence of angles) then one can, by the well known 
techniques of the theory of probability, deduce a number of important results. 
For example, let us divide the basic circle into two equal parts, denoting by A, 
a case in which the angle, is in the first part, and B, a case in which it is in the 
second part. This division provides us with a model of a process leading to 
results analogous to those in a game of coin throws. Evidently, in the long 
run, the angles will be uniformly distributed over A and B (except for a set 
of 0 of measure zero). Moreover, a simple calculation shows that various. 
combinations of events (e.g. combinations in which the n-th event is sepa- 
rated from the next by «, steps) also have a uniform distribution. Thus we 
reproduce another property of the game of coin throws. (For example, the 
probability that a head will be followed x, steps later by a head is the same 
as that it will be followed by a tail.) 

It is clear that we can now caleulate the probabilities of compound events 
in the same way as is done with the theory of probability. Thus the proba- 
bility of a combination of n events in which the angle is found n, times 
in A and n, times in B (with n=n,+n,) is just W= (4)"n!/n,!n,! the 
well-known binomial distribution. If n, n, and n, are large, one may obtain 
the gaussian approximation W(n, df) ~ exp [— (n/2)(df)?] where df= dn/n and 
dn is the deviation of n,/n from its probability (which is 4). 

This result means that if we followed the trajectory for any 0, (except for | 
a set of measure zero), we would in the long run obtain a gaussian distribution 
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in ònfn. Thus, we see how the gaussian distribution which is so typical of 
the theory of probability, can come out of a determinate sequence (ay 

A further step is to deduce Bernoulli’s theorem. For as n becomes larger 
and larger it is evident that one will eventually obtain a negligible number 
of cases in which df is greater than some specified number, e. This means 
that for a typical trajectory, practically all choice of a sample of n terms will 
lead to n,/n very close to 1. 

Finally, let us note that these results can be generalized, by showing that 
they also follow from types of laws intermediate between determinate and pro- 
babilistic. This can be done by methods analogous to those appearing in 
Secs. 6 and 7, but we shall not carry out such a demonstration in detail here. 


9. — A Criterion for Randomness. 


It can be seen that in the previous section we have justified the calculus 
of probabilities, as a means of treating the statistical properties developed in 
the long run by our sequence, 0, = K"0,. In doing this, we have also given 
the elements with which one can obtain a criterion for randomness. 

It is our conclusion that the property of statistical independence (i.e., ab- 
sence of correlations) furnishes a natural,and useful criterion for randomness. 
At this point, we must emphasize, however, that statistical independence 
with regard to all possible combinations of events will occur only in rather 
special problems (such as perhaps in the games of coin throws and dice). More 
generally, we may expect approximate and partial statistical independence. 
The degree of randomness may then be approximate, in the sense that the 
K-fold correlations vanish only in some degree of approximation, and not 
exactly. It may be partial in that some of the K-fold correlations vanish, 
but not all of them. As we have pointed out in Sec. 8, the important problem 
is then, whether the kinds of correlations that have physical significance in 
a given problem, and those that are of interest to us in certain applications, 
will be small enough to be neglected. If this is so, we may say that the distri- 
bution is «random in the context under discussion ». 

Asa justification for our proposed criterion for randomness, in terms of 
absence of correlations, we first point out that this is effectively the practical 
criterion most frequently used in applications in physics, (e.g., statistical 
mechanics) (+) as well as in other applications of statistics (e.g., problems 


(*) The gaussian approximation has already been obtained from determinate se- 
quences in other ways. See, for example, M. Karz and H. Srrmnnaus: Studia Mathe- 
matica, 6, 59 (1936). b, 

(+) In Section 10, we shall give an example of how this definition of randomness 
is used in statistical mechanics, in connection with the development of « random phases 
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of distributions of various kinds of members in a certain population). Secondly, 
we point out that it contains what is really essential in the definition given by 
von MISES (*). For if a sequence has the property of statistical independence 
of its members, then as we have pointed in Sec. 8, the gaussian law follows, 
so that most samples will tend to give good representations to the relative 
frequency distribution in the sequence as a whole. 

The criterion that we have given for randomness avoids the problems 
appearing in the definition given by von MISsES. First of all, we do not define 
randomness in terms of irregularity or lawlessness and thus, we do not become 
involved in the very probably unsolvable problem of defining what irregularity 
and lawlessness mean. Instead, we regard randomness as a special manifes-. 
tation of law, a manifestation which can appear however only in the contexti 
of a statistical aggregate and one of whose characteristic aspects is statisticall 
independence. In this regard, we adopt a position similar to that of WALD (*)/ 
and various other members (*) of the school of von Mises, who adopt as a: 
criterion for randomness a certain class of laws suggested by the impossibility. 
of «beating » a gambling game. Our criterion differs, however, from theirs,, 
in two aspects: (A) It is much easier to apply in practice in a wide range ofl 
statistical problems, and indeed corresponds to just the practical definitions 
that is most frequently used. (B) The criteria for randomness are not chosent 
completely a-priori or by the simple transfer to all possible problems of the 
criteria that may be significant in a given problem (e.g., gambling games). 
Instead, the criteria are determined either by the actual physical situation 
or by the statistical problems that we wish to treat, because the kinds of 
correlations that are significant may depend on the context in which the theory 


is applied. 

With regard to the question of irregularity or law-lessness, it appears to 
us that this is not an essential feature of the problem. For example, in the case 
of throws of a coin, regular orders, such as that of ten heads in a row will] 
certainly be quite rare. But any other specified order, however irregular ii 


may seem to us, will be equally rare. Thus, the rarity of « regularity » is now 
so much a property of the sequence itself as of our own tendeney to pick ou 
a few special orders and call them «regular », while the overwhelming majority} 
of orders are relegated to the very broad category of «irregular ». Hence¥ 
any distribution that fluctuates over a wide range of possible orders will ini} 
evitably seem irregular to us most of the time. 

Before ending this section, we shall make a few remarks on the relations 
ship between randomness and chance. Now, as we have seen, randomnes 


(*) See Section 4. 
(+) See Section 4. 


Mr 


ì 
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is a statistical property of a given system. It may evidently come from the 
effects of external disturbances that already have a degree of randomness ini 
them, and it may also arise from purely internal causes (as happened with our 
sequence 0, = K"0,): but most generally it arises because of some combination 
of these two kinds of causes. The assumption that statistical theories must 
always be restricted to the calculation of one kind of probability in terms of 
another is then equivalent to neglecting the possibility that a part of the 
randomness, can arise internally in a system, without the action of some 
external causes that already have in them some degree of randomness. 

On the other hand, as we saw in Sec. 2, chance refers to a relationship in 
which the events in a given context depend contingently on « independent » 
fluctuating causal factors lying outside the context of interest. Thus, while 
a random distribution can develop of its own accord in a given context, chance 
can by its very definition refer only to the effects of causes that lie outside 
the context under discussion. 

There is, nevertheless, a close relationship between randomness and chance. 
For in those cases of chance fluctuations that are most typical (e.g., coin 
throws), the external contingencies (e.g., the motions of the hand that throws 
the coin) on which the results depend, have a range of fluctuation within which 
the distribution is effectively random. Such cases, correspond to the category 
of « pure chance » discussed in Sec. 2. There may, however, exist a more 
general kind of dependence in fluctuating causal factors (*) that are not enti- 
rely random in their distribution. This dependence still leads to a kind of 
chance, but it is not pure chance. In such a case, the calculus of probabilities 
evidently would be at best an approximation. Here, then, we have an addi- 
tional example of the general statistical problem; for it is evidently of interest 
in such cases to study the question of how good is the degree of approximation 


provided by the theory of probability, as well as perhaps to seek a more general 


calculus that might apply even when we do not have pure chance. 


40. Further Illustration in Terms of Random Phases. 


In this section, we shall apply some of the ideas suggested in the previous 
sections to a problem that has widespread application in statistical mechanics ; 
namely that of the development of random phases in a system of oscillators 
or rotators. 

The simplest example is the problem of random phases with a system of 
harmonic oscillators. Let x; be the coordinate of a given oscillator, y; the 


(*) This possibility has also been discussed in Section 2. 
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corresponding canonically conjugate momentum. The equations of motion of 
such a set of oscillators (which are, in general, coupled), can be derived from 
the hamiltonian 


(10.1) H= 4 D> (iy Ys + Ddistoy; + 040) . 

af 
As. is well known, there exists a linear transformation, to the normal coordi 
nates (1), g; and to the conjugate momenta, p;, such that the hamiltonian 
reduces to a sum of squares, each equivalent to a separate harmonic oscillator, 
not coupled to the others. 


(10.2) H=4 > (Ap; + Ba). 
The equations of motion (q; = 0H/0p;,, dp; = — 0H/oq;) then lead to 
(10.3) 4 + (B/Ad)a =9, Di + (B:/A.;)p = 0 


so that each oscillator has angular frequency @; = (B;/4;)}. The solutions 
can then conveniently be expressed in the complex form 


(10,4) di + ip, = K, exp [tot], 
where A, is an arbitrary complex number, which can be written as 
(10.5) Ke ad exp (igs |. 
(10.6) qi + ips = (J)? exp [iat + gol - 
Evidently J; is proportional to the energy of the j-th oscillator and qo; 
is a constant of the motion corresponding to the initial value of the phase . 
angle of the j-th oscillator. 
As a function of the time, the phase angle is then given by 
(10.7) Pi = Poi = Gish: 
As we shall show in this section, if the ©; are incommensurable (as they 


usually are in most applications) the phases, g; will in time become random | 
relative to each other. 


The problem of random phases arises in a great many important problems, 


(7) H. G. GoLpsrein: Classical Mechanics (1952), Chap. 10. 


: 
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in statistical mechanics, e.g., thermal vibration of a solid, the thermal exci- 
tation of rotating molecules, motions which can be solved in terms of angle 
and action variables and most important of all, in the foundations of quantum 
statistical mechanies (7). The application of the notion of random phases 
in the foundations of quantum statistical mechanies, we shall discuss in more 
detail at the end of this section. 

In order to illustrate in a simple way just how the phases become random (*), 
let us begin by considering the case of two variables. Now, in the problem 
that we are treating here, we have the new feature, not present in the treatment 
given in sections 7 and 8, that instead of a sequence of discrete values, we 
have to deal with a continuous function of the time t. For this reason, we shall 
replace the sum over the N terms of the sequence appearing in eq. (8.7) by 
an integral over the time, representing the mean value of an arbitrary function, 
f(pilt), p.(t)). (The use of the continuous time average in this case corresponds 
to the actual practice in the definition of physical quantities in statistical 
mechanies ('3)). We then obtain as a criterion for a uniform distribution that 
the following relationship must hold independently of initial conditions (except 
possibly for a set of measure zero): 


n 2a 27 


(10.8) lim (1/7) | f(g), p»(t)) at =| [ fo. 2) de, dg, « 


0 eG. 


The limits on g,, g, are taken from zero to 27, because as can be seen from 
(10.6), the system comes to the same physical state when g, or 9, change by 27. 
This means also that all physically significant functions of y, and g, can be 
expressed as periodic functions with period 27 or that 


(10.9) H@1, Po) = > fr exp [i(m@1 + np), 
Ores 
n, and n; being integers. 
Now, writing 9, =@a + Wil, Pr = Por + @2t we obtain 


(10.10) (Pigs) = ta exp [4(MiPor + NjPor) + UNM, + NjOo)t |. 


Inserting (10.10) in (10.8), we note that if m, and ©, are not commensur- 


(2) See, forexample, R.C. ToLman: The Principles of Statistical Mechanies, (( »xford, 
1950), Chap. LX. 

(*) The development of random phases is already well known (see, for example, 
reference (°)). However, we shall give a brief sketch of a proof here which is convenient 
for illustrating the concepts involved. 

(23) See, for example, reference (5). 
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able every term on the left hand except fo) will integrate out to zero as the 
time 7 approaches infinity. On the right hand side, we see from (10.9) that 
likewise only fy, will survive integration over y, and g,. Thus we demonstrate 
that the criterion for equidistribution is satisfied independently of initial con- 
ditions (*) (note that in this problem, there is no exception at all, not even 
of a set of measure zero). We conclude then that there exists a uniform pro- 
bability density that the phase point will occupy any specified region of the 
configuration space of gi, Ps. 

For a more general case, in which there does not exist a uniform distri- 
bution, eq. (10.8) should be replaced by 


T 270 27 
n 


A011) dim (1/2 ff] | Por. g2) Hs P2) Apr Apr , 


0 0 0 


where P(g,, 2) is the probability density in the configuration space of 91, ds 
and where the equation must hold independently of initial conditions (except 
for a possible set of measure zero), if the motion is to be such as to permit the 
definition of a probability. 

To show in more detail how the equidistribution comes about in the case 
that we have treated, let us consider the ~,, 9: plane as shown in Fig. 5, where 
1; 9, go only over their physically significant 
range of 27. Consider an initial point, P, with 
1 = Yun and g, = 3. Then with the passage o£ 
time, the phase point will move until it finally 
reaches the point, Y, where one of the variables, 
fi OY Pz is equal to 27 (in the diagram, it is mp, that 
is equal to 27). At this. point, P must decrease 
abruptly by 27 and jump to the point, Q’. Then 
the point moves until it reaches 7”, then jumps to 
0)" moves to Q'Y, ete. The theorem of equidistri- 
bution then shows that in time the phase point will fill the above region equi- 
densely. Evidently this can happen only if ©, and ©, are incommensurable, | 
for if they are commensurable, the phase point will describe a closed curve. 

In the space of the p, and the qg; we obtain the well-known Lissajous figures, | 
which fill the space densely, but not in general equi-densely. | 

The generalization to any number of variables is quite straight-forward: 
and we need only state the result that if all the © are incommensurable, then 


PA 


di 


(*) This result is essentially a proof of the quasi-ergodic theorem for the special | 
problem that we are treating. (See section 6). 


2 
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the distribution will have a uniform probability density in the configuration 
space (@1...@y). Eqs. (10.8) and (10.11) then represent a statement of the 
fact that the time average behavior of a single system can be represented by 
a suitable average over angles of an ensemble of systems, taken at a single 
instant of time, ¢. Thus, the use of statistical ensembles (which are essentially 
the same as the collectives of von MIsEs) to represent the statistical properties 
of a single system is justified. The uniformity of the distribution in the con- 
figuration space evidently implies that P(q,, Qo... ~,) = P(~,)P(q») ... P(gy) 80 
that the phase angles of the different oscillators are uncorrelated, and there- 
fore statistically independent. Physicists have gradually come to refer to a 
system of variables that behaves in this way as «random ». (Thus, we obtain 
a justification of the definition of randomness suggested in Sec. 9, at least as 
far as applications in physics are concerned). 

As we have already pointed out, the problem of random phases has a wide 
application in statistical mechanics. This problem is particularly important 
in quantum theory, because it enters as an essential part of the foundations 
of quantum statistical mechanics. To see how this comes about, we note tha, 
as is well known, the quantum state of a system is defined by its wave functiont 
)(x1...%,) where x,...x, are the coordinates of the system. An arbitrary 
wave function can be expressed in terms of the series of eigenfunctions Pg (X1.-.%,) 
of the Hamiltonian operator, corresponding to eigenvalues, £,. Thus, 


Mia ag,, CX Dil Htc 


(10.12) IDA 


(We are assuming the system to be in a bounded region of space, so that the 
eigen-values are discrete, and we are assuming that there is no degeneracy. 
The case of degeneracy requires a more extensive treatment, which we shall 
not give here.) 

We then write 


(10.13) Cr, = Un EXP [Pon], 
obtaining 

(10.14) pied. Any, EXP [i@,]}, 
with 

(10.15) On = Pon — Ent |h . 


Comparing (10.14) and (10.15) with (10.6) and (10.7) we see that formally 
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speaking, the wave function acts like an infinite collection of harmonic oscil- 
lators, one for each eigenfunction of the hamiltonian. This is not surprising, 
since the Schrodinger equation is linear in y and as is well known a wave 
field satisfying a linear partial differential equation can be expressed in a 
series of terms each oscillating with its own characteristic frequency. 

Now we note that in general, the angular frequencies w, = H,t/h of oscil- 
lation of the wave function in different stationary states, will not be com- 
mensurable (*). In the long run, we shall therefore have a uniform distri- 
bution in the configuration space of the phases, and the phases will be random 
relative to each other. 

Finally, we remark that the problem of random phases, is formally of the 
same type as the one treated by POINCARE in connection with the distribution 
of the phase angles of a swarm of asteroids in neighbouring orbits (see Section 8). 


11. — Conclusion. 


In this paper, we have shown that in a large number of statistical problems 
arising in physics, it is either advantageous or necessary to go outside the 
framework of concepts that appear in the theory of probability. Among such 
problems are the following: 


(1) The calewation of the large-scale statistical and thermodynamic pro- 
perties of an aggregate containing a large numbers of molecules. 
(2) The treatment of non-equilibrium states of such an aggregate, 


(3) The demonstration that a given kind of sequence contains statistical 
relationships that permit its long-run behavior to be treated by 
means of the calculus of probabilities. 


(4) The estimate of error in the theory of probability. 
(5) The treatment of approximate and partial randomness. 


(6) The treatment of chance fluctuations originating in contingencies 
that are not totally independent of what happens in the context 
under discussion. 


An inspection of the above list would suggest that the problems of sta- 


tistical physics in which the theory of probability does not apply are perhaps 


(*) They are commensurable for the one-dimensional harmonic oscillator, but not 
in most other general problems. 
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even more important than those in which it does apply. A systematic attack 
in statistical problems going outside the Scope of the theory of probability, 
coupled with an effort to develop new formal systems that are more general 
than that of the calculus of probabilities, would therefore be likely to produce 
fruitful results. 
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1. — Introduction. 


The difficulties in connexion with the construction of large proportional 
scintillation counters are well known. It can be plainly said, that it is no 
longer useful to speak of proportionality, when the three counter dimensions 
become excessively great. If only two, or one, of the three dimensions are 
very great, one can then still speak of proportionality, provided that one uses 
a sufficiently high number of photomultipliers. 

We wished to make a detailed study of the causes of disuniformity in counters 
measuring some decimetres, for which the problem of the realization of pro- 
portional counters can still reasonably be put. 

When it is necessary to construet counters of this tvpe, we find ourselves 
faced with a very wide possibility of choice. In fact, a choice can be made of 
transparent, reflecting or diffusing walls; a greater or smaller number of P.M.’s 
can be used, which can be arranged to look the scintillator in the most varied 
manners; the scintillator can be chosen from a large quantity of substances 
which are suitable for this purpose and, finally, if the experiment does not 
expressly require a special shape, even the form of the counter can be 
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arbitrary. It is somewhat difficult to establish a priori what can be, from the 
point of view of economy, the most convenient combination of all these 
elements. 

In this work we wished to put in evidence the possible criteria which can 
point the way to a resolution of each particular problem. 

We conducted our analysis beginning with a particular examination of a 
transparent-walled parallelepiped-shaped counter, establishing in this case some 
criteria which permit, from the point of view of designing, an estimation of 
the uniformity which can be obtained with this arrangement. The extension 
of the analysis to other shapes and wall types, which was carried out by direct 
comparison with the first one studied, allowed us to arrive at some conclusions 
which may be of use when a counter of the above-mentioned dimensions is 
planned. 

The research on the temporal behaviour of light collection was also carried 
out with the same criterium. That permitted us to establish the inferior limit 
of the resolving time of these counters which does not influence the uniformity 
of the response. 


2. — Generality. 


We shall say that a scintillation counter has a uniform response when the 
electric charge collected at the P.M. anode is proportional to the light emitted 
in seintillation and independent of the position at which the emission took 
place. Such a counter, for the sake of brevity, will be called uniform. This 
condition is necessary, but not sufficient, for it to be proportional (*). 

The realization of a uniform counter shows itself to be more complex as 
the dimensions of the said counter are greater. 

The following factors influence the uniformity: 


1) the type of walls (diffusing, transparent, reflecting), the shape of the 
counter and the position in which the P.M.’s are placed; 


2) the absorption of the walls and the scintillator; 
3) the disuniformity of the photosensitive surface (*). 


The first two factors have an influence on uniformity in light collection, which 
is realized when the relation between the light which falls on the photosensi- 


(*) Proportionality implies considerations on the energy loss in the counter and, 
therefore, on the type of particle to be counted and on the nature of the phosphor. 
(+) This expression is intended to indicate: 
a) in the case of a single P.M. the different sensitivity from point to point of 
the photocathode ; 


hb) in the case of more than one P.M. the different gain also. 
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tive surface and that which is emitted is independent of the position at which 
scintillation takes place. 

To understand the ròle played by the third factor it is necessary to bear 
in mind that light is not distributed uniformly on a photosensitive surface. 
In fact, the quantities ot incident total light being equal, the distribution 
depends on the position at which scintillation takes place. 


3. — On the Uniformity of a Counter with Transparent Walls. 


31. Geometrical Considerations. — In a transparent-walled counter one part 
of the light emitted in scintillation remains entrapped and another escapes. 
In order to get uniform collection, and leaving out for the moment the ques- 
tion of absorption, it is necessary (but not sufficient) for the percentage of en- 
trapped light to be independent of the position at which scintillation takes 
place. Now, spherically (*) and cylindrically (?) shaped counters do not satisfy 
such a condition. The one geometrical shape which realizes uniform entrapping 
is that of a parallelepiped (**). In fact supposing that the material consti- 
tuting the parallelepiped to be homogeneous and given the limit angle of this 
medium in respect to the air as y, it can be seen that there exist six 2y aper- 
ture cones through which light escapes (*). It is easy to estimate the per- 
centage of entrapped light (1) once the refractive index n has been given. 

Simple geometrical considerations show clearly that it is necessary, if all 
the entrapped light is to be collected by the photosensitive surface, for the 
latter to cover at least one entire face of the parallelepiped. 

Every other arrangement of the photosensitive surface which does not. 
satisfy such a condition, always causes the loss of one part of the entrapped 
light, a part which depends on the position of scintillation. It is necessary 
to remark, however, that, with an entire face which is photosensitive, not: 
only the entrapped, but also the direct, light is collected. In order for this 
contribution to be independent of the scintillation position, it is necessary 
that scintillation should never take place at a distance lower than | = d-tg y 
(see Fig. 1) from the photosensitive face, where d is the diagonal thereof. In 
other words, the photosensitive face must be placed at a distance 1 from the. 


(*) W. A. SHURCLIFF and R. C. Jones: Journ. Opt. Soc. Am., 39, 912 (1949). 

(2) D. BRINI, L. PeLI, 0. RimoNnDI and P. VERONESI: Nuovo Cimento, 11, 655 (1954). 

(*) R. H. GILLETTE: Rev. Scient. Instr., 21, 294 (1950) 

(*) Rigorously (4), a small percentage of this light is reflected and partially polar- 
ized. On an average these rays undergo other reflections in order to reach the photo- 
sensitive surface, through which, even setting aside scintillator absorption, they are 
in practice rapidly extinguished. 

(4) F. A. Jenkins and H. E. Wire: Fundamentals of Opties (London, 1953), 
(Chapter 28). : 
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scintillator. Naturally this space must be filled with material having the same 
refractive index as the scintillator. Such optical connexion is normally called 
pipe light. For successive calculations, the pipe light will be supposed as a 
prolongation of the parallelepiped (Fig. 1), which is not generally the case 
m practice. 


From now on, then, we shall make a distinction between the collection 
surface and the photosensitive one, which are the two opposite surfaces of 
the pipe light, distant / from one another, and parallel. 

In a counter so made, and continuing to leave out the question of absorption, 
‘and under the condition y < 45° (n > V2) the percentage of light collected 
Will appear as 

R = 4(5 cos y —3)-100%. 


For the refractive index n» = 1.5, such a percentage, which will hereafter be 
called the collection yield, appears as k ~ 36%. With two opposite photo- 
Sensitive faces a yield of R ~ 49% is obtained. 

The same yield could be obtained with one photosensitive face only making 
the opposite face reflecting, provided that the reflection coefficient be one. 

It is then evident that with an increase in the number of photosensitive 
faces the yield only increases (the question of absorption is as always set aside). 
Obviously there is very little practical advantage to be gained from considering 
a counter with more than two entire collection faces. 

From what has been said above it is apparent that in order to have a 
transparent-walled counter with light collection and good yield, it is necessary 
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that following essential conditions be satisfied : 


a) parallelepiped shape: 
b) at least one entire collection face; 


c) pipe light with a minimum length of 1=—d-tgy. 


In practice, except in the case of a phosphor-plastic mixture, the scintillator 
is always placed inside a container, the walls of which will be either diffusing, 
reflecting or transparent. We shall examine the last case first making the 
hypothesis (which in practice can be done with a good approximation) that 
the refractive index of the material which constitutes the container is identical 
to that of the phosphor. Under this condition it is the container which must 
satisfy the required geometrical conditions. That means that the parallele- 
piped is limited by the external faces of the container, while the scintillator 
inside it can have any form whatsoever. Nevertheless, if this condition is not 
required by particular experimental exigencies, it is convenient for the scin- 
tillator also to have a parallelepiped shape . This implies that the container 
walls be made of plain parallel surfaces. 

For the discussion of the causes of disuniformity 2) and 3) we shall con- 
tinue to refer to a counter which satisfies the preceding conditions. 


3°2. Absorption influence. — When the dimensions of the counter (and, 
therefore, the ranges of light) are considerable, light absorption on the part 
of the walls and the scintillators plays a very important ròle in regard to 
uniformity. Such absorption, in addition to having an influence on uniformity, 
obviously influences the yield also (together with the absorption due to the 
pipe light). In designing a big counter it is, therefore, of great importance 
to choose the most suitable materials to be used, with the aim of reducing 
to a minimum the disuniformity due to absorption, compatible with the di- 
mensions of the counter itself. It is obvious that, in order to make exact cal 
culations in this connexion, it is necessary to bear in mind: 


a) the scintillator emission spectrum; 
b) the scintillator absorption spectrum; 


c) the absorption spectrum of the material of which the walls and pipe 
light are made. 


Unfortunately in practice, exact information on these data is not always 
available. Insofar as regards the emission spectrum, it is known that the rela- 
tive measurements are neither simple nor very exact (5-7). 


(5) H. KALMANN and M. Furst: Phys. Rev., 81, 853 (1951). 
(6) F. B. HARRISON: Nucleonics, 10, n. 6, 40 (1952). 
(7) R. C. Sanesrer: M.I.T. Technical Report 55 (January 1, 1952). 
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The absorption spectra can, on the contrary, be sufficiently well measured 


abstracting from the indetermination due to the purity of the phosphor. 


To estimate the absorption by the walls, scintillator and pipe light of the 


light emitted in scintillation, let us suppose, as we mentioned before, the re- 
fractive indices of the various materials (scintillator, walls and pipe light) to 


be 


equal. 
Let us again suppose that scintillation is in a point region. Let us con- 


sider Fig. 1 and indicate with: 


a, b, c, the edges of the parallelepiped which has, as a collection surface, 
a surface cb; 

y, the distance of scintillation P from the collection surface (this is the 
only co-ordinate of P which is of interest for these considerations in this 
particular geometrical form of the counter); 

r, the generic ray escaping from P; 

%, a light range in a wall ac (Fig. 2); 

s, the thickness of the horizontal walls; 

%, a light range in a wall ab (Fig. 2); 

s, the thickness of the vertical walls, parallel to y; 

xs the light range in the pipe light; 

t=d-tgy the length of the pipe light; 

x, the light range in the scintillator; 

«(A) the scintillator absorption coefficient; 

%»(A) the wall absorption coefficient; 

%,(A) the pipe light absorption coefficient; 

N, the number of reflections on the horizontal walls; 

N, the number of reflections on the vertical walls, parallel to y. 


Let us suppose that, in a scintillation, a number of photons 90(4) of a 


wavelength between A and A+dA are emitted isotropically. The number of 


69 — Supplemento al Nuovo Cimento. 
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photons emitted in the solid elementary angle dw = cos gdgd09 which arrives 
on the photosensitive face is given by 
T0(A) 


(1) d977(4) = buo exp [— ay(4)(N0, + N20) — ap(A)cs — es(A)0] do dA . 


An approximate calculation of N, and N, can be made by considering the 
projections of the generic ray r escaping from scintillation, one on a vertical 
plane and the other on a horizontal plane (Fig. 1). We then obtain 


t y 
M=ttgo=3 DET M=ttg0. 


N, and N, will evidently have to be integers. Assuming them as given 
by the two preceding expressions, we make, in effect, an overestimation of the 
absorption due to the walls. 

From Fig. 2 it can immediately be deduced, respectively, that: 


2s, 28> 285 


Wi a= t= = — - 
- cos 8 cosq sino 


[SC] 


ro 


Analogously it can be found that: 


l y 
187 Cos @ cos 6’ 1 Cosy così ae es 
Then (1) becomes 
T,(A) 
(2) dA, tn 8x? [— A(y, Z)/cos gp cos 0] cospdg d0 dA, 


where it is 
Sy 


Aly, 2) = y [eu(A) + 2a) — (è 


+ al “0 


We shall carry out the integration of (2) in respect only to 0 and g, con- 
sidering an interval 4/ of A within which, practically speaking, the quantities 
dependent on it can be considered constant. The relative quantity of light 


; AIA, 9, ©) 7 ee 
given by AR(A) = Wa) received by the photonsensitive surface 
ey 0 
p 0 


can be considered split in two contributions AR, and ARS 


1) AR, represents the quantity of the light emitted by the source 
which reaches the semispace turned towards the photosensitive face in re- 
spect to a plane parallel to this; 
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2) AR, is the analogous contribution due to the light emitted by the 
opposite part of the plane considered, which, through successive reflections, 
can reach the photosensitive face itself, 


Strictly speaking it would be necessary to bear in mind, when calculating 
this contribution, the absorption due to the one range in the wall opposite to 
that of collection. That is expressed in a prolongation of the length J of the 
pipe light by a quantity (in the case of x, = «,) equal to 2s. 

Our calculations were car- 
ried out without paying at- | 
tention to 2s, in respect to J, n 
a condition which, in prac- 
tice, it is opportune to rea- 
lize. 20 

The detailed calculations 
will be reported in Appen- 

dix I. These calculations al- ‘* 
low for the construction of 


the two curves AR, and AR, SL 
10 


(Fig. 3) which give us, re- 
spectively, the yields due to 
the two contributions 1) and 
2) as functions of A. Simple 
geometrical considerations 
show that, for a given value 
of y, contribution 1) corres- 
ponds toa certain value A (y, A) 
and contribution 2) to a value 
of A(2a —y, A). For a generic scintillation taking place at a distance 
y, these curves allow the estimating of the relative contribuitons AR, 
and AR,, which refer to the various intervals A which satisfy the conditions 
yppointed in the beginning, in which the emission spectrum can be considered 
subdivided. The total contribution 4, for an interval, is expressed by: 


AR Ta; A ie + ARs , 


vhile the total contribution due to all the spectrum intervals considered will 
;vidently be given by 
SAR. 

In the case in which the photosensitive faces are two entire opposites, the 
ame considerations unfolded above are valid, except that it is necessary to 
ake into consideration only the curve AR, in correspondence to the two values 
1(y, 7) and A(a—y, A). The relative contribution of light collected then, 


iva 


1056 D. BRINI, L. PELI, 0. RIMONDI and P. VERONESI 


corresponding to the interval AA, will be: 
AR = AR, + AR,, 


where AR, is calculated for A(y, A), AR, is calculated for A(a — y, A). 

This arrangement permits an improvement both in the yield as in the 
uniformity of collection. 

We might think of reaching the same results by substituting a photosensi- 
tive surface by a reflecting one. In this case, maintaining the usual sym- 
bolism, we should obtain: 

AR = AR, EAR 


where AR, has its usual meaning, and AR, corresponds to a A(2a —y, 4) 
and k is a factor < 1 which represents the reflection coefficient. In this case: 
the yield and collection uniformity are midway between the two cases treated! 
in precedence. The results obtained relative to a counter studied experimentally; 
will be referred to hereinafter; these will help to clarify the conclusions drawn: 
above. 


3°3. Influence of the P.M. Photocathode Sensitivity. - The preceding remarks 
refer to the quantity of light collected by the photosensitive surface inde- 
pendently of the spectral sensitivity characteristic of the P.M. It is evident. 
however, that, given a certain emission spectrum, the collected light spectrum 
will be different with the varying of y because of the selective absorptiom 
of the scintillator, walls and pipe light. Since the P.M.’s too present selective 
sensitivity for the various wavelengths, this is expressed by the fact thatt 
the number of photons being equal, there is a different charge response. Using 
the P.M. sensivity curves and the results of § 2 which permit to express the 
relative quantity of light collected in the spectral interval A and 4 + dd, the 
charge accumulated at the anode by the effect of this contribution will be: 


AQ(A) = g- AR- QA) (*). 


(*) Assuming Birks’ (5) notations g corresponds to the product 
Opzf (vp) (CR)™ 


where the first factor is the photoelectric conversion efficiency of the cathode and th 
second is the overall gain of the tube which is expressed by © (collection efficienc? 
factor of the photoelectrons by the dynode system) R (electron multiplication factor); 
m (number of the dynodes). Op,f(vp) is a quantity which is variable from point 
to point on the photocathode surface (°) wherefore, in a very strict estimate, it woul: 
be necessary to bear also this in mind. We shall see later on how the calculation ca 
be made. 
(5) J. B. Birks: Scintillation Counters (London, 1953), Chapter 2. 


(*) F. H. MARcHALL, J. W. Cotrmann and A. I. BENNET: Rev. Scient. Instr., 19 
744 (1948). 
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3°4. Influence of Light Distribution on the Photosensitive Surface. — Another 
cause of disuniformity is to be found in the different values of the ratio Q/W=g 
(total charge of the anode/number of photons fallen on the photocathode) 
for the different P.M.’s which look at the counter, 97 being equal. The 
shape of the counter being known, its dimensions, type of walls, etc. (character- 
istics which have already been chosen for the best uniformity in light collection), 
it is now therefore a problem of seeing in which way the use of any number 
of P.M.’s influences the uniformity of the electric signal with the varying 
of the position at which scintillation takes place. Should the n P.M. all have 
the same value of the Q/9 = g ratio, it is obvious that there would be no 
cause for disuniformity. If the parameter g of each tube were different, but 
the quantity of light which falls on each one of them remain constant (even 
if differing from tube to tube) with 
the varying of the scintillation position, 
there would not be disuniformity even x n 
in this case. It should be a question, x 
on the contrary, of disuniformity, be- ‘ 
cause the values of g are different \ 
among themselves, and the quantity of 
light received separately by the various 
tubes varies with the varying of the 
scintillation position. 

These considerations can be deter- 
mined quantitatively. Let 9 be the 
total quantity of light received from 
the photosensitive face and 97, that re- 
ceived from the i-th tube. Let us sup- 
pose as given the behaviour of all the 
@;s in function of the scintillation 
position P (Fig. 4). (The figure reports 
the case of n = 3). Whatever their value may be, a point-by-point deviation 
AV, from the mean value 977, = 97/3 can be determined. If the counter is 
optically uniform, it is 77 = cost, i.e. the light received from the photosensi- 
tive face is the same whatever P may be. Bearing in mind the sign of 477,, 
it follows that > 497, = 0. The charge contribution for each tube is 


OF = gv, 


where Q, indicates the value of the charge obtained at the output of the 7-th 
P.M. The last relation can also be written 


Q; = GJiTlm ale G: AN: O 
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Summing up the n tube contributions, we get 


Qir = Yi m È g: AM, + 927 m ae G2 AM» =f eo nd Une Sa In AM n = 
aa YIM +» yg: AN: 4 


The term >, 9:97 is constant and does not influence the disuniformity, 
wherefore the contribution to the latter is due only to 


(3) dG AM: 


in which the different 4197;, we repeat, have well-defined signs such that 


The expression (3) becomes zero if 


a) $1=92=-+. = In since, as it has been said, >, Ag7;=0. 


b) Every 97; = const for any point at which scintillation takes place. 


In general these two conditions have never been verified; the first because 
of the unfailing difference in the value of g between one P.M. and another. 
(It is well to remember that in every case the values of g; can be adjusted 
by a suitable variation of the tension in one of the intermediate states of mul- 
tiplication). Even in the case of a sole P.M. the uncertainty about the value 
of g is equally dependent on the different sensitivity point by point of the photo- 
cathode. The second condition is not verified because light is distributed dif- 
ferently on the photosensitive face with the varying of the scintillation position, 
whatever may be the counter form and its dimensions. 

To estimate (3), we may take as known the g; mean value g,, with a p% 
error, and we may imagine as known every single deviation Aq7,;, in value 
and sign, for whatever point of the counter, in respect to the mean value 97, 
of the luminous flux received by each tube. 

The expression (3) then becomes 


Im DAN: E Delos: Im >| 47; | 


If the counter is at uniform collection, with Y}, 97; = 0, the contributior 
to disuniformity is reduced to 


(4) D'107° gm | A, | i 


' greatly altered in respect to the pipe light 
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where the value of 497; depends on the point at which scintillation takes 
place. The meaning of (4) is clear; it represents the maximum possible, 
positive or negative, separation from its mean value da Imm Of the re- 
sponse. 

The maximum percentage value of disuniformity obtained is then 


(5) (*) SARI) 


It can be seen that, case by case, it is necessary to know the point-by-point 
value of 4197; in order to estimate what uniformity can be expected. This 
means, in other words, that we have to find the light distribution function 
on the photosensitive walls. 

The curves of Fig. 4 refer to a counter of the type which has been discussed 
previously, and represented schematically in Fig. 5. The pipe light of such 
a counter, even with a length = d-tg y, shows 
two parallel, plane-surfaced cuts of a length TERRA 
h<l. With this arrangement, the uniformity | 
of light collection has again been realized, but 
its distribution on the photosensitive surface 
takes place as if this were found on the upper 
plane which limits the cuts, and is therefore 


without cuts. The cuts are in practice necessary 
for the connexion between the collection surface 
and that of the P.M. We have been referring 
to a counter of this type, because it corresponds 
to the one we actually used and which will be 
discussed later on. Fig. 4 reports in ordinates, 
in arbitrary unit, the quantity of light received 
by each tube and in abscissae the distance Y—h, 
where it is Y=y+/. The curves have been cal- 
culated (see Appendix II), for the points of 
the straight line drawn which corresponds to a P.M. axis. 

From (5) and the curves of Fig. 4 and admitting thatp=10 and Y —h=4 em, 
a maximum disuniformity of ~ 8% is to be expected, as a consequence of 


(*) This expression always leads to an overestimation of disuniformity due to this 
cause, while the contribution to disuniformity of all the P.M.’s is supposed as summed. 
This condition in practice can be avoided with a convenient choice of P.M.’s, where- 
fore we are permitted to think that the values furnished by (5) are increased in respect 
to the real ones. 
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light distribution on the photosensitive surface. The family of curves in Fig. 6, 
where p is the parameter, gives the maximum disuniformity in function of 
(1 — h)/(e/3), where 1 is the value of Y for which there is maximum disuni- 
formity, and the | —h differences has been normalized at c/3. The arrow is 

to indicate the abscissae whereby the 
Î maximum disuniformity in the counter 
| we constructed is obtained. 

If the photosensitive surface is that 
corresponding to the photocathode of a 
sole P.M., this estimation can still be 
important (*!°). In this case it is suf- 
ficient to think of subdividing the pho- 
tocathode surface in a convenient num- 
ber of equal areolae, sensitivity being 
equal in each. Repeating the preceding 
arguments, we find that the maximum 
relative disuniformity is still given by (5) 
where the > is extended to the areola 
number in which the cathode surface is 
subdivided, and the value of p expresses 
the uncertainty of the information of g 
on every areola considered (*). Estima- 
tions of the influence of p on the reso- 


| lutive power in amplitude of a crystal- 
el, 


Disuniformity % 


FASI 


spectrometer carried out with a sole 
P.M. 5819 have been made by Bor- 
KOWSKI and CLARK (!) and GODLOVE 
Fig. 6. and WADEY (!2). The results of these 

last authors indicate that photocathode 

surface disuniformity is not determinant to the resolutive power effects. This 
essentially depends on the average sensivity of the photocathode. To state the 
matter more exactly, the resolutive power depends fundamentally on the 


(1°) G. F. J. GARLICK and G. Y. WrIGHT: Proc. Phys. Soc., B 65, 415 (1952). 
(*) Referring once more the already-mentioned notations of BrrKs, there is a 
contribution to the uncertainty of 9g on every areola by the product 


Opal (vp)e" 


where c’ is the collection efficiency factor of the photoelectrons from the cathode at 
the first dynode of the multiplication system. 

(1) C. J. Borkowski and R. L. CLARK: Rev. Scient. Intsr., 24, 1046 (SVE), 

(2) T. F. GopLove and W. G. WADEY: Rev. Scient. Intsr., 25, 1 (1954). 
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Statistical fluctuations of the photoelectrons, which are smaller as the mean 
sensivity is greater. 

For a counter which is not optically uniform the expression of the maximum 
relative disuniformity becomes 


where the first term is due to disuniformity in collection (depending on the 


Shape of the counter and on absorption), and the second to light distribution 
disuniformity. 


4. — Uniformity of a Counter with Reflecting Walls. 


A counter with reflecting walls and suitably arranged collection surface 
(arranged so as to receive sooner or later, after successive reflections, all 
the light emitted in scintillation) would have a uniform collection if it 
were not for scintillator absorption and the wall reflection coefficient not being 
equal to one. In fact (*), the reflection coefficient is always less than one, 
and, moreover, is selective for the different wavelengths. The rays which leave 
the scintillator with a sufficiently large m angle reach the collection surface 
after many reflections, and undergo an attenuation through reflection and 
absorption which grows with increasing distance of the scintillation from the 
collection surface. It is evident then that such a counter must be very dis- 
uniform, especially because of the rays escaping with a large gy. In fact there 
will be a great difference between the light collected, in the case in which 
scintillation takes place near the collection surface, and that in which scin- 
tillation takes place far off. 

A substantial improvement can be obtained by eliminating those rays 
which are greatly inclined, since they are the preponderant cause of disuni- 
formity. This object can be reached by adopting the parallelepiped shape 
with at least one entire collection face, such a face to be united to the photo- 
sensitive one by means of a transparent lateral-surfaced light pipe, not inferior 
in length to d tg y, and similar to that examined in the previous case for 
transparent-walled counters. The influence of the pipe light is that of reducing 
the light which is incident on the photosensitive surface to that corresponding 
to the case of the transparent-walled counter. 

For reasons which have been explained previously, the use of such a pipe 
light excludes the consideration of any shape different from the parallelepiped 
one. In order to carry out a comparison between two parallelepiped-shaped 
counters, one with reflecting walls and the other with transparent ones, having 
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equal dimensions and employing the same scintillator, it is sufficient to take | 


into consideration the exponent A of (2). 
In the second case let us remember that it is 


(6) A, = yfa(A) + Cr} + la, (A), 


where it is 


In the first case it is easy to show that it is 


(7) A, = y{a(4) + Or} + lay(4), 
where it is 
C, = (=! = eee (A, 9; 0) 


and «, represents the absorption coefficient due to reflection, related to the: 
reflection coefficient k by 


k(A, v, 6) = exp [—@,(A, g, 0)]. 


We may state that «,, being in every case k< 1, will always result posi- - 
tive and, then, we shall always have C, > 

From a comparison of (6) and (7), it can be seen that two different situations : 
can be obtained: 


a) a(A) >a,(A). That corresponds to C,<0. It will then always re-. 
sult. A; = A,; 


b) a(A)< a (A). It will then result C,> 0. In this case a comparison! 
between the two types of counters is not immediate. 


From this examination it can be affirmed that the counter with transparent: 
walls is, in situation a), always the most convenient of those which have been. 
examined up to the present. 

This was the case, for example, of the counter which we constructed, in 
which the walls were of plexiglass, and the scintillator of phenyleyclohexane + 
+ terphenyl + diphenylhexatriene. 

The use of glass walls (which is necessary when the scintillator damages 
the plexiglass) causes us to fall, in many cases, into situation b), for which 
reason the convenience of the use of reflecting walls cannot be here rejected. 

We considered that a strict calculation of the uniformity of counters with 
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reflecting walls would not be necessary. We think in fact that, for counters 
constructed with the best reflecting surfaces, those which were completed for 
transparent walls will be still sufficiently approximate. 


5. — Uniformity of a Counter with Diffusing Walls. 


In order that a counter with diffusing walls have uniform collection it is 
necessary (leaving aside for a moment the question of scintillator absorption) 
for it to have an internal spherical shape and a collection surface which is 
small in respect to the spherical one, in such a way as to realize the conditions 
of Ulbricht’s sphere. Strictly speaking, such conditions can never be realized. 
In fact in such a counter 


a) the collection surface is never negligible in respect to the spherical 
one (unless there is a priori renunciation of a good collection yield); 


b) the collection surface always collects a certain percentage of light 
which comes directly from scintillation. When scintillation takes place 
very close to the collection surface, the direct light amounts, practically 
speaking, to the half of that emitted. At greater distances, on the 
contrary, the direct light becomes negligible. ; 


If, moreover, absorption is taken into account, a 
further very grave cause of disuniformity is introduced. 


It is easy to see that the said absorption can be taken 

into account by conveniently diminishing the diffusion 

factor 4 of the spherical walls to a value of uw’ < w (*). 

An evaluation of the disuniformity of this counter 

can be made if we consider as still valid the results of 7 4 
Ulbricht’s ideal sphere relatively to the light which is 

not directly collected by the collection surface. Let 

us indicate with (Fig. 7): R 


— R the radius of the sphere; 

— r the collection surface radius (supposed as circular) ; 
— y the distance of scintillation P from the collection Fig. 7. 
surface. 


(*) The new diffusion coefficient can be considered as expressed by 
u'= 4 exp [= 06 gem | 
where yw is the wall diffusion coefficient; 


x; 18 the scintillator absorption coefficient ; 
Rm=(4/3)R the mean course of the light between two diffusions within the sphere 


of radius FR. 
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The percentage of direct light (overestimated since we suppose that all 
the rays originating from P follow the same range in order to arrive on the 
collection surface) will be | 


2a(l—=c | 
By = OE) exp [— asp 10? = 5 î 


Y 
Vy? Lr? 


exp [— «,]10?, 


where, given the indicative character of this estimation, «, represents a con- 
venient mean value of the scintillator absorption coefficient relative to the 
different wavelengths of the emission spectrum. The percentage of diffused 

light which will reach the collection surface by diffusion will be 


Boe Aor — 22(1 — cos a) ser? /4a0R? he 
fear An (1-4) ® (r?/4ak?)- i 


(1 bees as 102 
yt E if 8 7) 


Neglecting r? in respect to R?, since r? < R?, the percentage of light collected 
will be: 


2 


13 ( : | ip. 
SENESI IIS SEE 102, 
[2 per ti SRL + w°) "Vr =) 


Making a numerical case, by assuming the values: 


pia LO hake Ss de =0* eras pi=="‘0.935 CINA 
we obtain respectively for the two limit positions y= 0 and y= 2R 


(Re + Ba)y=o = (50 + 0.5) % 
(Ri + Ra) 


je 
Such a counter then shows itself to be completely disuniform. It may be of 


interest to observe that when the condition y? > r? is satisfied, (8) becomes 


(9) R,+ Rix 100 % , 


r2 
4R2(1 — LK’) 


which does not depend on the position at which scintillation takes place. 
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This situation can be realized practically by placing a suitably-shaped 
scintillator in a convenient region of the sphere in respect to the collection 
surface, provided that the remaining region of the sphere be filled with a well- 
transparent medium having the same refractive index as the scintillator. In 
this case, if the absorption coefficient of this substance is less than «,, uw’ will 
increase, and so improve the yield. 

Should it be desired to examine shapes other than the spherical one by 
using diffusing walls, the situation will worsen still further, since the sym- 
metry which exists in the case of the diffusing sphere is lacking. In this con- 
nexion M. MANDÒ’s (1%) conclusions can be considered valid, according to which 
the results of Ulbricht’s sphere can be extended approximatively to other 
shapes, provided that these are sufficiently regular and that the three 
Maximum dimensions therein are not very different. 


6. — Experimental Proofs. 


We constructed a _ parallelepiped - shaped liquid counter, measuring 
18 cm x 18 em xX 2 em, with plexiglass transparent walls having a thickness 
s=2.0 mm, in which light collection was carried out on an entire wall (one of those 


LM fet, 


of the smaller area). To this a pipe light (Fig. 8), also of plexiglass, with a length 
of 1=17 cm, was applied. For the joining of the rectangular collection face 
with the photosensitive surface constituted by 3 P.M. 5819 photocathodes, 


(13) M. Manno: Nuovo Cimento, 12, 5 (1954). 
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the pipe light was suitably cut with two long incisions h = 13 cm. As scin- 
tillator we used a solution of 3 g of terphenyl + 1 litre of phenyleyclohexane 
+10 mg of diphenylhexatriene. 

On the basis of the considerations discussed above, the disunifornity esti- 
mated for the effect of absorption shows itself to be ~ + 11% in respect to the 
mean value. So, taking the preceding 
calculations as a basis we get a 
maximum yield of ~ 16% for scin- 
tillations near the pipe light and a 
minimum yield of ~13% for those 
which take place at the maximum 
distance therefrom. In calculating 
disuniformity due to absorption we 
supposed the emission spectrum to 
be of constant intensity on all wa- 
velengths and to be limited between 
the values 1=3300A and 2=4 700A. 
The solution and plexiglass absorp- 
tion coefficients were measured di- 
rectly with a Bekmann’s spectro- 
metre on samples with a thickness 
of 10 cm. The results of these mea- 
surements are reported in Fig. 9. 

Analogous calculations, carried 
out for the same counter furnished 
with an aluminium reflecting plate 
3500 4000 4500 (mean reflection coefficient 0.95) 

Fig. 9. which was applied to the face op- 

posite to the collecting one, per- 

mitted an estimation of disuniformity due to absorption as ~7%. The 

maximum yield in this case was estimated at ~ 20% and the minimum at. 

~17%. The curves in Fig. 10 report the behaviour, in arbitrary units, of 

the counter’s response as a function of the distance of the scintillation. They 

were calculated, for the two cases considered, by taking absorption only into 
account. 

To the disuniformity due to absorption must be added that due to light distri- 
bution on the photosensitive surface. The comparison among our P.M. values 
of g allowed us to estimate a value of ~ 10 for p. From the curves of Fig. 4 
we calculated a maximum disuniformity of —8% in the counter without a 
reflecting plate. 

The experimental proofs were carried out with a device which is schema- 
tized in Fig. 11. A ®°Co source of 50 millicurie was collimated with a block 


plexiglass 
p.c. exane 


È 


of Pb of the thickness of 20 cm, in which a hole with a diameter of 8 mm was 
bored. The counter was tested in several points by the collimated small beam 
and the direct current escaping from the P.M.’s measured with a galvano- 
meter. The currents of each phototube were sufficiently weak as to consider 
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that eventual saturation space 

Î | charge phenomena would not 

take place. For these reasons 

we consider that the results 

a | E obtained are still valid in an 
a . impulsive operation. 


no refl. 


150-- 


RM. 


50 


“25-91 653) 0/943" #69 


pe ee eee 
2 -9 -6 =a) ie} +3 +6 +9 +12 


Fig. 10. Fig. 11. 


Equal response curves, relative to the two cases treated above, are shown 
in Fig. 12. We limited the estimation of the equal response curves to a sur- 
face of the counter of 16x16 cm, leaving out a border of 1 cm because, on 


reflection 


no reflection 


st 
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this edge, the influence of the beam width, which in our case was estimated 
at — 2 cm, made itself felt. 

Fig. 10 also reports the maximum and minimum values of the response 
measured along axes parallel to the collection face and normalized in respect 
to the point signed by a cross. The response variations along these axes are 
ascribable to the sole disuniformity of light distribution. The results of the 
measurements, relative to the maximum variation of the response, allow an 
estimation of this disuniformity in the values — 5% and — 4% respectively 
for the counter without and with a reflecting plate. 

The experimental results relative to the counter without a reflecting plate: 
show a disuniformity of — 14%. This value is certainly in default, because: 
the responses on the counter edge near the collection face were not taken into 
consideration. The disuniformity calculated appears, on the contrary, ex- 
pressed by the sum of the disuniformity due to absorption (~ 11%) and that 
due to light distribution (~ 8%). If however, we take into account the fact; 
that the latter contribution was largely overestimated and can be reduced! 
by some units, we see that the experimental results are in good agreement,, 
within the limits of the approximations made, with the theoretical ones. 

In the case of the counter with a reflecting plate, we found a disuniformity; 
of — 11% which was still in good agreement with our previsions. Finally 
we should like to observe that the yield increase due to the addition of the 
reflecting plate is also in good agreement with the increase calculated. 


7. — Light Collection in Time. 


It is important to make some observations on the temporal development off 
light collection in an impulsive operation in order to underline the advantages: 
which, even in this connexion, a counter with transparent walls presents ini 
respect to those furnished with other types of walls. In a parallelepiped-shaped! 
counter the light collected is due to two contributions; the first relative to 
the light which is propagated towards the collection face, and the second whichi 
goes in the opposite direction and reaches the collection surface after having: 
undergone reflection on the opposite surface parallel to it. This second con- 
tribution superimposes itself on the first, shifted in respect thereto by a time: 
dependent on the scintillation position and counter dimensions. A strict cal- 
culation of the temporal function of collection in order to estimate the time 7” 
necessary for the collected light to reach its maximum value (practically 99%) 
turns out to be very laborious, even if rather simple from a theoretical pointi 
of view. 


È 
x 
N. 
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Appendix III reports the calculation of the collection function in a part- 
icular case. The results of this calculation, nevertheless, allow us in general 
to establish an inferior limit for the collection time, a limit which can be 
expressed by 


(10) T=ge+h=m + 


where 7 is the decay time of the phosphor, t, the maximum shift time between 
the two contributions, v is the propagation speed of light in the medium and 
n is a convenient number depending on the counter dimensions, which in 
practice goes from 5 to 10. The value of 7, all other conditions being equal, 
is always less in a counter with transparent walls than in one with reflecting 
or diffusing walls. In fact, in these last cases, a contribution is made to the 
pulse by luminous rays which travel a long way before reaching the photosensi- 
tive surface, rays which are for the most part eliminated in the case of trans- 
parent walls. The tension pulse at the P.M. anode has the same proceeding 
as light collection if the time constant of the output integral circuit is much 
greater (theoretically infinite) than 7. This condition must be realized when 
a proportional counter is required. 

We propose to assume the time 27 as the resolutive time value for such 
a counter; 7 also suggests the pulse formation time value. 

We wish finally to stress that the estimation of t, is very important when- 
ever it is necessary to use large counters. 


8. — Conclusion. 


From what has been said in the preceding paragraphs, we can draw some 
conclusions, which are generally valid, and which refer to the construction 
of minimum-disuniformity scintillation counters, the dimensions of which are 
some decimetres. 

a) The parallelepiped shape, with transparent walls, and at least one 
entire collection surface connected to the photosensitive surface with a sui- 
tably long pipe light is convenient. With two entire photosensitive faces, 
which are opposite one to the other, uniformity as well as yield is improved. 
An intermediate situation is obtained by substituting a reflecting plate for 
a photosensitive face. 

b) AIl other conditions being equal, a counter with reflecting walls is 
always more disuniform than one with transparent walls. A considerable im- 
provement in uniformity can be obtained in this case by using a pipe light 
with transparent walls. Such a counter gives a performance which is not far 
off that of the one with transparent walls. It can prove useful when, for 
technical reasons, transparent walls cannot be realized. 


70 — Supplemento al Nuovo Cimento, 
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c) A counter with diffusing walls can be uniform only if the conditions: 
of Ulbricht’s sphere are obtained, and if the scintillation zone is opportunely 
limited. This must never be too near the photosensitive surface. In this case: 
the counter yield is very low and is inferior to the one obtained with the pre- 
ceding arrangements. 


d) In the best of the solutions proposed, absorption and non.regular: 
light distribution on the photosensitive surface always remain present as causes: 
of disuniformity. These play roles of different importance according to the: 
dimensions of the counters. For counters which measure some decimetres: 
the first cause prevails over the second. 


* OK x 


Our grateful tkanks are due to Mr. A. CoLLINA for his construction of thes 
counter described in the present paper. 


APPENDIX I. 


We shall carry out the integration of (2) in relation to the two contribut- 
ions 4k, and AR,, subdividing the integration field in a suitable manner. 
We can at once see from Fig. 13 that 


7/2 2/2 m/2 7/2 y ‘n/2 
| il i ; 
(Aa) AR ==] / ff 0,9apa0-] fd, 8, eavao—/ |ra, 0,0 
g=0 6=0 p=7j2-y 0=0 g=0 6=0'y 
for A(y, A) and 
ee dp 
(A.2) AR, = AR-3 | ee 9, 9) dp dé 
p=0 6=0 


for A(2a—y, A), where it is 


and 6, and 0, are such as to satisfy 


f(A, 6, p)= exp [— A/cos g cos 6] cos p DIS 
the relations ! 


ar 5 
o Vsin’ Vis DSO 


) 


cos 6, 
COs @ 


V sin? y — sin? 9 
COS © ’ 
Given the difficulty of such integra- 


tions, these have been performed by 
the grapbic method. 


. Il 
soi 
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APPENDIX II. 


Let a trirectangular parallelepiped (Fig. 14) made of transparent material 
be given; at a point P therein is placed a pointform source which emits light 
in all directions. We want to calculate the distribution of the luminous 
flux which falls on the face situated on the plane z. It is a question funda- 


Fig. 14, 


i i faces 
mentally of estimating the fluxes collected by each of the square sur ; 
in ol the counter collection face can be considered divided, each face cor- 
responding to the photo-cathode of a P.M.. (In the case of the figure the 


P.M. number is three). i 
The problem can be resolved by dividing the plane x into many contiguous 
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squares, equal to those in which the collection surface was divided. When a 
ray escapes from P in a certain direction, it will, after successive reflections. 
fall on one of the squares of the collection face. If the ray is imagined as 
propagating in a direct line without undergoing reflections, it will fall on the 
plane x within a square corresponding to that of the collection surface. The 
ray r in Fig. 14 illustrates this situation. 

The quantity of light which invests the element of the area dadz on the 
plane x will then be given by 


ag TW Y da dz 
ES) ~ Age (w? + VY? + 22)2’ 


where 97 is the total quantity of light emitted by the luminous source, Y the 
distance of this from x and x and ¢ the coordinates of the surface element 
The total light collected by one of the squares in which the photosensitive sur: 
face was divided is given by: 


j 


7 Mei Yard 
ce ame z| le ae 


where %1;, 25, 213, and 2,; are the coordinates of the vertices of one of tha 
generic squares of the plane 7 corresponding to the one designated on thu 
collection surface. The four branches of the hyperbola, which represent tha 
intersections of the four cones escaping from P with the axes parallel to tha 
plane x and which have a semi-aperture y, have the equations: 


xo? tg°y = Y° + 2? and tg? = VY? 3 


in the reference system chosen. They represent the variation limits of th: 
coordinates x; and z;, in the sense that in the summing-up (A.4) there nil 
be computed only those squares (or portions thereof) which lie in the regior 
of the plane x between them. For every one of the areas in which the col 
lection surface has been divided and is in agreement with every value of J 
chosen on an axis normal to 7, equ. (A.4) can be graphically estimated in 
sufficiently rapid manner and with good approximation. 

Since disuniformity in distribution is due mainly to the light which reache; 
the collection face almost directly, i.e. after few reflections, the graphic co 
putation can be arrested for an area round the centre of the reference systeri 
which is not too extended, while considering the rest of the light collecte@ 
by the face to be uniformly distributed. Fig. 4 shows the results obtaines 
a the parallelepiped-shaped counter with transparent walls describe: 
above. 
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APPENDIX III. 


Our present purpose is that of calculating the collection temporal function 
of photons for a scintillation which takes place at a distance Y, by the photo- 
sensitive face. We shall refer to a counter with reflecting walls, parallelepiped 
in shape with one entire photosensitive face, and we shall suppose there to 
be no absorption. The considering of absorption does not meaningly compli- 
cate the calculations, but only makes them more laborious. On the other hand 
these hypotheses allow us equally to make an estimation of the inferior limit 
of the 7 value of (10); and this is the estimation which interests us. 

Let us suppose (Fig. 15) that a scintillation takes place in P. Assuming 
as the time origin (all measured in t units) the time in which the first photon 
reaches 0, the number of photons which arrive inside the solid elementary 
angle dw, the photosensitive surface 
up to the time ¢ is given by: 


GM 
dq = = [1—exp[(t—t)]]d@, 


where ¢ is the time taken in going 
over the distance x — Y. The solid 
elementary angle dw can be expressed 


by 


22 Yt dt, 


Fig. 15. il 
SC rrgs cen 


where v is the light propagation velocity in the medium considered. The total 
number of photons collected by the photosensitive surface up to the time #, 
will be: 


IC i 2a Ytv 
(A.5) ait) = at elesse n - mo do 


0 


The superior limit ¢ of ¢, is established by the fact that it must always be 


lo < t. 
The integration of (A.5) leads to 


(A.6) Ti) = È {1 e'—Ae-ME(A +1) — E(A)}} 


where A =Y/tv. 
As can be seen from (A.6) and has already been said before, the collection 


function depends on Y, the distance of scintillation from the photosensitive 
surface. Has 

The (A.6) refers to a sole semispace of emission. The contribution due to 
the other semispace is calculated with a relation analogous to (A.6), where, 
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in the place of Y, must be put 2a — Y, and where of course the times begin-. 
ning with the same origin must be counted. i 

For purposes of demonstration, we have reported in Fig. 16 some behaviours: 
of (A.6) for various values of A. As can be seen the time necessary for reaching; 
the maximum depends in a marked degree on A. 


In the case of transparent walls, collection appears without doubt more 
rapid, since the tail contributions make themselves less felt. When we take 
the absorption into account, it can be seen that this cuts off the contributions 
due to the longer paths and so further reduces the slow part of the signal. 

We consider therefore that only in particular conditions the time T does 
reach the total value ~107T+ 7. 


‘ 
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Uber den EinfluB von Fehlern des magnetischen Feldes 
auf die Betatronschwingungen im Synchrotron 
mit starker Stabilisierung (1). 


G. LUDERS 


Max-Planck-Institut fiir Physik - Gottingen 
Europdische Organisation fiir Kernphysikalische Forschung (CERN) 


(ricevuto il 25 Marzo 1955) 


InHaLT: Einleitung. a) Das Prinzip starker Stabilisierung 5) Typen 
von Feldfehlern und ihre Auswirkungen. — I. Allgemeine Behandlung. 
1. Fehler im magnetischen Fihrungsfeld. a) Gleichung der gesnderten 
Gleichgewichtsbahn. 6) Spezielle Falle. c) Anwendung auf die «syn- 
chrone Bahn». 2. Fehler im Feldindex. a) Storungsrechnung fiir Fehler 
im Feldindex. 6b) Gleichungen fiir den instabilen Bereich. c) Flanken 
der instabilen Bereiche. 3. Verschiedene Probleme. a) Gleichzeitige 
Wirkung von Fehlern im Fihrungsfeld und im Feldindex. 6) Ahnlich- 
keitsgesetze. 4. Zusammenfassung von Teil I. — II. Statistische Aus- 
wertung. 1. Mittelwerte. a) Geanderte Gleichgewichtsbahn, Vorbemerkungen. 
b) Geanderte Gleichgewichtsbahn, Vernachlassigung der Amplituden- 
schwankung entlang des Umfangs. c) Gesnderte Gleichgewichtsbahn, 
Amplitudenschwankung entlang des Umfangs. d) Instabile Bereiche. 
2. Wahrscheinlichkeitsverteilungen. a) Geinderte Gleichgewichtsbahn. 
b) Instabile Bereiche. 3. Zusammenfassung von Teil II. - Anhang 1: Be- 
wegungsgleichungen in linearer Naherung. a) Vernachlassigung der Be- 
schleunigung. b) Bericksichtigung der Beschleunigung. — Anhang 2: Zur 
stérungsmissigen Behandlung kleiner Anderungen des Feldindex. a) Das 
Innere eines stabilen Gebiets. b) Die Grenze eines stabilen Gebiets. 
c) Instabile Bereiche infolge Fehlern im Feldindex. 


(1) Diese Arbeit gibt im wesentlichen eine Zusammenfassung von Forschungs- 
berichten, die vom Verfasser fiir die Europiische Organisation fiir Kernphysika- 
lische Forschung (European Organisation for Nuclear Research, CERN) bis zum 
Herbst 1953 angefertigt wurden. Die Untersuchungen wurden durchgefihrt am 
Institut fir Theoretische Physik der Universitàt Kopenhagen in hauptamtlicher 
Titigkeit ftir die Theoretische Studiengruppe von CERN und am Max-Planck- 


1076 G. LUDERS 


Einleitung. 


a. Das Prinzip starker Stabilisierung. — Das Synchrotron (*) stellt einenr 
Beschleuniger dar, bei dem die geladenen Teilchen, z.B. Protonen, durch eint 
magnetisches Fiihrungsfeld in der Nahe einer in sich zuricklaufenden Bahn) 
herumgefiihrt werden. Die Teilchen durchlaufen elektrische Wechselfelder und! 
nehmen bei richtiger Phasenlage in Bezug auf diese Wechselfelder wahrend} 
eines Beschleunigungszyklus fortgesetzt Energie auf. Eine bestimmte Teilchen-. 
bahn im Synchrotron wird als Gleichgewichtsbahn bezeichnet; sie stellt eines 
kreisihnliche geschlossene Kurve dar und verlàuft bei den iblichen Entwurfen: 
in der horizontalen Ebene. Das magnetische Fiihrungsfeld am Ort der Gleich-. 
gewichtsbahn ist dann senkrecht gerichtet; diese Richtung sei im folgendeni 
als 2-Richtung bezeichnet (von der Méglichkeit einer Feldkomponente paralleli 
zur Gleichgewichtsbahn sei abgesehen). Damit die Gleichgewichtsbahn fur 
ein Teilchen der Ladung e und mit dem Impuls p eine mégliche Teilchenbahm 
darstellt, muB das Magnetfeld an einer bestimmten Stelle der Gleichgewichts- 
bahn mit dem Kriimmungsradius o an dieser Stelle in der Beziehung steheni 


(E.1) H,=—= 


(c = Lichtgeschwindigkeit). 

Infolge der Beschleunigung nehmen Geschwindigkeit und Impuls der Teil-. 
chen zu. Die Frequenz der Beschleunigungsspannung mu8B umgekehrt pro-. 
portional zur Umlaufzeit gewahlt werden; wahrend des Beschleunigungszyklus 
mu sie daher monoton zunehmen. Damit ein auf der Gleichgewichtsbahn. 
startendes Teilchen wahrend des ganzen Beschleunigungszyklus auf der Gleich-: 
gewichtsbahn bleibt, muB das Magnetfeld gemàB Gl. (E.1) dem jeweiligen: 
Impuls angepaBt sein und seinerseits monoton zunehmen. 

Unter idealen Betriebsbedingungen bleibt ein Teilchen, das mit dem rich- 
tigen Anfangsimpuls in richtiger Phasenlage zur Beschleunigungsspannung auf 
der Gleichgewichtsbahn startet, wahrend des ganzen Beschleunigungszyklus 
auf der Gleichgewichtsbahn. Dieses Teilchen sei als Gleichgewichtsteilchen, seine 
momentane Energie als Gleichgewichtsenergie bezeichnet. Fir den erfolg- 
reichen Betrieb des Synchrotrons kommt es darauf an, daB auch Teilchen, die 
mit etwas anderen Anfangsbedingungen (Impuls, Phase und geometrische 


Institut fir Physik, Gottingen, in nebenamtlicher Tatigkeit fiir die Protonen-Synchrotron- 
Gruppe. Die Einreichung zur Veròffentlichung wurde durch ziuBere Umstinde verzogert. 
Kleinere Erginzungen wurden nach dem Herbst 1953 hinzugefigt; diese wurden vom 
Verfasser als Mitarbeiter der Protonen-Synchrotron-Gruppe in Genf erarbeitet. 

(*) V. VEKSLER: Journ. Phys. USSR, 9, 153 (1945); E. M. MoMILLAN: Phys. Rev., 
68, 143 (1945). 
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Anfangsbedingungen) starten, oder spàter durch ZusammenstéBe mit dem Gas 
und Abweichungen von den idealen Betriebsbedingungen gestirt werden, in 
der Umgebung des Gleichgewichtsteilchens bleiben und ebenfalls den Be- 
schleunigungsprozeB erfolgreich durchlaufen. Es stellt sich daher das Problem 
der Stabilitàt. Teilehenbahnen mit abweichenden Anfangsbedingungen miissen 
oszillatorische Bewegungen um das Gleichgewichtsteilehen herum ausfiihren. 
Die Schwingungen von Teilchen richtiger Energie und Phasenlage um die 
Gleichgewichtsbahn, auf die sich die vorliegende Arbeit im wesentlichen be- 
schrànken wird, werden als Betatronschwingungen bezeichnet, da sie im Be- 
tatron in ahnlicher Weise auftreten. Teilchen mit einer von der jeweiligen 
Gleichgewichtsenergie abweichenden Energie und einer abweichenden Phasen- 
lage beschreiben andere Bahnen im Synchrotron; die hiermit verbundene Weg- 
verlangerung oder -verkiirzung hat Schwingungen um die Gleichgewichts- 
energie und die Sollphase zur Folge. Infolge dieser Phasenstabilitàt ist es 
daher nur notig, die jeweilige Frequenz und 
das Magnetfeld an einander anzupassen; bei 
hinreichend groBem Scheitelwert der Beschleu- 
nigungsspannung stellt sich die stabile Phase 
dann von selbst ein (°). È 
Stabilitàt der Betatronschwingungen, d.h. 
eine oszillatorische Bewegung der Teilchen um 
die Gleichgewichtsbahn wird erreicht, indem 
inhomogen wahlt. Bezeichnet man die vertikale cathe gn ni 
Auslenkung eines Teilchens mit 2, die horizontale Angabe der von der Gleich- 
Auslenkung senkrecht zur Gleichgewichtsbahn gewichtsbahn aus gemessenen 
(fort vom Mittelpunkt des Beschleunigers) Koordinaten rund 2. Polschuhe 
e Op A mee mane ferner die Und: magnetische » Meldlinien 
: : È sind eingezeichnet. 
Bogenlinge entlang der Gleichgewichtsbahn in È 
einer Einheit 4, die wahrend eines Umlaufs 
von 0 bis 27 zunimmt, so gelten (bei Vernachlissigung der Beschleuni- 
gungsspannung) in linearer Naherung folgende Bewegungsgleichungen 


[ 29) — n, (9)2(0) — n (9)r(9) = 0, 


(E.2) 
| r"(d) — n7(0)2(9) — nar(d)r(d) = 0, 


Die Striche bedeuten Ableitungen nach der GréBe 0. Die Gleichungen sind 
abgeleitet und im einzelnen erlàutert in Anh. 1. Es ist (mit 27h als Umfang 


(3) Vgl. D. Boum und L. FoLpy: Phys. Rev., 70, 249 (1946). 
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der Gleichgewichtsbahn) 


(E.3) 


DaB hier nur Ableitungen der z-Komponente des magnetischen Feldes vor- 
kommen, riihrt daher, daB die vertikale und horizontale Komponente des 
magnetischen Feldes durch die Maxwellschen Gleichungen mit einander ver- 
kniipft sind. Der Einfachheit halber soll weiterhin angenommen werden, daB 


(E.4) Nzr(D) fai Nr(P) =0, 


was z.B. erfillt ist, wenn das magnetische Feld symmetrisch beziiglich der 
Ebene der Gleichgewichtsbahn ist. Die Betatronschwingungen in vertikaler 
und horizontaler Richtung sind dann unabhangig von einander. 

Bei dem Alteren Stabilisierungsprinzip (4) wird die Gleichgewichtsbahn als 
Kreisbahn (0(9) = R) und 0H,/0r lings des Umfangs konstant gewahlt. Die 
Bewegungsgleichungen (E.2) werden dann gewohnliche Oszillatorgleichungen 
und man hat oszillatorisches Verhalten der vertikalen und horizontalen Be- 
tatronschwingungen, falls gilt 


ROH, 


Se eh 


(E.5) Dinar nop 


Die Wellenlange der Betatronschwingungen (kurz: Betatronwellenlange) ist 
dann fiir beide Schwingungsrichtungen gréBer als ein Umlauf. Das neue, von 
CHRISTOPHILOS, COURANT, LIVINGSTON und SNYDER vorgeschlagene (5) « Prinzip 
starker Stabilisierung » (9) erlaubt dagegen prinzipiell das Auftreten einer be- 


(*) D. W. Kerst und R. SERBER: Phys. Rev., 60, 53 (1941). 

(*®) E. D. Courant, M. S. Livingston und H. S. SNYDER: Phys. Rev., 88, 1190 
(1952), im folgenden als CLS bezeichnet. Fir die Frage der Prioritàt vgl. E. D. COURANT, 
M. S. Livingston, H. S. SnyDER und J. P. BLEWETT: Phys. Rev., 91, 202 (1953). 

(*) Verf. méchte sich dem englischen Sprachgebrauch, das Prinzip als « strong 
focusing principle» zu bezeichnen, nicht anschlieBen, da dieser Ausdruck (oder die 
entsprechende deutsche Ubersetzung) zu stark an optische Abbildungseigenschaften 


denken làBt. Der neuerdings benutzte Ausdruck «alternating gradient principle » er- 
laubt keine einfache deutsche Ubersetzung. 
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liebig groBen Zahl von Betatronwellen auf dem Umfang; der Teilchenstrahl 
wird dadurch stàrker zusammengehalten und besitzt bei gleichen Eigenschaften 
des Injektors einen geringeren Querschnitt (9) 

Fur die mathematische Formulierung des Prinzips starker Stabilisierung 
sollen die entkoppelten Bewegungsgleichungen (E.2) (vgl. Gl. (E.4)) verein- 
facht geschrieben werden. Es sei nur eine der beiden Schwingungsrichtungen, 
etwa die horizontale, explizit betrachtet; die Bewegungsgleichung werde ge- 
schrieben 


(E.6) (9) — n(B)r(9) = 0. 


Die GròBe n(3) = n,,(9) soll als Feldindex bezeichnet werden (8). Das Prinzip 
starker Stabilisierung beruht dann darauf, daB der Feldindex n(9) als perio- 
dische, sowohl positive wie negative Werte annehmende Funktion lings der 
Gleichgewichtsbahn gewahlt wird 


(E.7) nd+9) = nd). 


Die Periode 3, mu8 dabei einen ganzzahligen Bruchteil eines vollen Umlaufs 
(27) darstellen 


(E.8) 3, = 


Die (ganzzahlige) GroéBe M soll als Zahl der Perioden des Feldindex auf dem 
Umfang bezeichnet werden. Ist 0(3) periodisch mit derselben Periode, so folgt 
aus Gl. (E.3), daB fiir die vertikale Schwingung ein Feldindex mit derselben 
Periode auftritt, der im wesentlichen das Negative des in Gl. (E.6) vorkom- 
menden ist. Obwohl sich die mathematische Analyse auf die horizontale 


(7) Bei quantitativen Uberlegungen muf beachtet werden, dafi — bei Injektion 
nur wihrend eines Umlaufs — der Injektor nicht etwa allein durch den Offnungswinkel 
des in das Synchrotron injizierten Strahls charakterisiert ist. Vielmehr liegt bei dem 
Zusammenfiigen des Vorbeschleunigers und des Synchrotrons ein Problem vor, das 
dem der stoBfreien Verbindung zweier Wellenleiter sehr dhnlich ist. Unter Benutzung 
elektrischer oder magnetischer Linsen lift sich stets eine optimale Anpassung erreichen, 
bei der die Amplitude der beim Einschieen entstehenden Betatronschwingungen am 
kleinsten ist. 

(8) Diese Definition stimmt mit dem iiblichen Sprachgebrauch nicht iberein, jedoch 
schien es sich nicht zu empfehlen, fiir die in dieser Arbeit auftretende Grofwe n(#) einen 
neuen Namen einzufilhren. Die meist als Feldindex bezeichnete Grofe ist im 
wesentlichen mit n,,(0) (Gl. (E.3)) identisch; vielfach wird noch ein Faktor (R/o(9))? 
abgespalten, was sich fiir die hier vorzutragende allgemeine Theorie nicht empfiehlt. 
Bei Anwendung der Ergebnisse dieser Arbeit sollte auf die Unterschiede der Definitionen 


des Feldindex besonders geachtet werden. 
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Schwingungsrichtung beschrànken wird, mu natirlich beachtet werden, daB 
Stabilitat sowohl der horizontalen wie der vertikalen Schwingungen erforder- 
lich ist. 

Hat man irgend zwei Losungen (9), 7.(0) der Differentialgleichung (E.6), 
so ist die aus ihnen gebildete Wronskideterminante 


(R.9). Wa rr) — (90) 


unabhingig von 4; sie ist von null verschieden, falls die beiden Lòsungen 
linear unabhiingig voneinander sind. Ferner ist wohlbekannt, da& sich die 
Periodizitàt (E.7) des in der Differentialgleichung als Koeffizienten auftretenden 
Feldindex n(9) in einer Art Periodizitàt der Lòsungen wiederspiegelt (°). Man 
kann nàmlich im allgemeinen zwei spezielle linear unabhangige Losungen 
r:(9), 2 (9) (»Floquetsche Lòsungen ») finden mit den Eigenschaften 


(E.10) 1049) = Ari?) , ta(0 1- Oo) = Azar (7). « 


Die Faktoren 4,, A, sind von ? unabhangig. Da die Differentialgleichung reell 
ist, sind beide Faktoren entweder reell oder sie sind zu einander konjugiert 
komplex. Aus der Konstanz der Wronskideterminante folgt, daB das Produkt 
A), stets gleich eins ist. In der komplexen 

bi wae tl A-Ebene liegen die beiden Multiplikatoren also 
entweder auf der reellen Achse oder auf dem 
Einheitskreis (Abb. 2). Im ersten Fall hat man 
fiir 4,, 2,4 + 1 neben einer abklingenden Lésung 
auch eine solche, die wahrend der Umlaufe 
(formal, d.h. innerhalb der linearen Nàherung) 
uber alle Grenzen wachst. Eine beliebige Lésung 
Abb. 2. — Komplexe Ebene Oder — physikalisch gesprochen — eine beliebige 
der Floquetschen Multipli- Teilchenbahn stellt eine Linearkombination dieser 
katoren /,, 2,;vgl.G1.(E.10). beiden speziellen Lòsungen dar. Liegen die Bild- 
punkte auf der reellen Achse, so gibt es also fiir 

Ay, 42 # +1 Bahnen, deren Amplitude (formal) itber alle Grenzen wachst; 
man hat dann Instabilitàt. Im Falle komplexer, auf dem Einheitskreis liegender 
Multiplikatoren 4,, A, bleibt dagegen die Amplitude der Betatronschwingungen 
auch nach beliebig vielen Umlàufen beschrankt; man hat Stabilitat. Schreibt man 
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(B.11) A, = exp lin), A, = exp [— iu], 


(9) Sog. Hillsche Differentialgleichung; vgl. etwa E. KAMKE: Differentialgleichungen, 
Losungen und L ésungsmethoden, Bd. 1, 410, Leipzig, 1943. Dort weitere Literaturhinweise. 
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so hat man Stabilitat fiir reelles uv. Macht man die GròBe 4 eindeutig, etwa 
durch die Vorschrift 0 <u< 7, so bezeichnet man die GréBe (2773) /u als (im 
Winkelma8 gemessene) Betatronwellenlinge. Unter Benutzung von M aus 
Gl. (E.8) gibt 

wm.) q = Me 


270 


die Zahl der Betatronwellenlàngen auf einem Umlauf an. 

An Hand der Abb. 2 kann man leicht den EinfluB einer stetigen Anderung 
der Verlaufs der Funktion »(#) innerhalb eines Periodizitàtsintervalls auf die 
Stabilitàt diskutieren. Im Anfang liege etwa Stabilitàt vor. Bei stetiger An- 
derung des FeldinGex werden dann die Bildpunkte auf dem Einheitskreis stetig 
in einer bestimmten Weise wandern; sie bleiben dabei stets spiegelbildlich zur 
reellen Achse (1°. SchlieBlich werden beide Bildpunkte den Punkt +1 oder 
— 1 erreichen. Hier liegt eine Verzweigung der Ortskurve vor, und eine weitere 
Anderung der Verteilung des Feldindex wird die Bildpunkte im allgemeinen 
zunachst ein Stiick auf der reellen Achse wandern lassen, bis sie umkehren und 
sich wieder auf dem Finheitskreis bewegen. Man erhàlt also eine Unterbrechung 
des stabilen Bereichs durch einen instabilen Bereich. 

An den Grenzen des stabilen Bereichs ist entweder A=-+1 oder A= — 1. 
Es gibt dann wenigstens eine Lòsung, die Gl. (E.10) befriedigt; sie ist perio- 
disch (fir = +1) oder antiperiodisch (ftir = —1). Falls sich an dieser 
Stelle auch bei Verinderung des Feldindex durch Multiplikation mit einem 
konstanten Faktor eine Grenze des stabilen Bereichs befindet, so gibt es nur 
eine einzige periodische (oder antiperiodische) Losung 


(E.13) (040) = + 10) 
und fiir alle anderen Lòsungen gilt 


(E.14) r(9+0o) = + (1(9) + ar,,,(8)) 


mit einer von der speziellen Lésung abhingigen Konstanten a. Liegt bei der 
angegebenen Verànderung des Feldindex keine Grenze des stabilen Bereichs 
vor, so sind alle Lòsungen periodisch bzw. antiperiodisch. Der mathematische 
Beweis dieser Behauptungen kann in Anh. 2b gefunden werden. 

Bin besonders einfaches Modell fiir ein Synchrotron mit starker Stabi- 
lisierung, das sich wegen der unvermeidlichen magnetischen Streufelder aller- 


(1°) Besteht die Anderung des Feldindex in einer Multiplikation mit einem kon- 
stanten Faktor, so erfolgt die Wanderung der Bildpunkte monoton mit diesem Faktor, 
also ohne Umkehr der Richtung, wie in Anhang 2a gezeigt werden wird. 
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dings nicht streng verwirklichen làBt, erhalt man, wenn man jede Periode des 
Feldindex in zwei gleichlange Stiicke mit einem Feldindex gleichen Betrages 
aber entgegengesetzten Vorzeichens aufteilt. Wahlt man den Betrag des Feld- 
index gro8 gegen eins, so werden die Bewegungsgleichungen fiir beide Schwin- 
gungsrichtungen nahezu identisch; mit der Stabilitat etwa der horizontalen - 
Schwingungen ist dann auch die der vertikalen Schwingungen gewahrleistet. 
Quantitativ ergibt sich in diesem Fall nach CLS fiir 4 die folgende Gleichung 


4 ni 


n? 
(E.15) COS ft = COB 1 ae cosh x ve 


Dabei stellt n den absoluten Betrag des Feldindex dar. Man hat Stabilitat, 
wenn die rechte Seite dem Betrag nach kleiner als eins bleibt. Als kennzeich- 
nender Parameter tritt in der Stabilitàtsbedingung die GréBe n?/M auf. 

Man kann leicht Varianten dieses Entwurfs angeben. Zum Beispiel kénnen 
die Feldindizes aufeinander folgender Sektoren zwar weiterhin mit entgegen- 
gesetzten Vorzeichen aber von verschiedenem Betrag gewahlt werden. Die 
entsprechenden Modifikationen der Gl. (E.14) sind von CLS angegeben worden. 
Es ist dann nétig, auf die Stabilitat beider Schwingungsrichtungen besonders 
zu achten. Das von CLS angegebene Stabilitàtsdiagramm ist (fùr den untersten 
Stabilitàtsbereich) in Abb. 3 wiedergegeben. Die Bedeutung der im Innern 
eingezeichneten Linien wird in Unterabschnitt è in Zusammenhang mit der 
Diskussion von Feldfehlern erlàutert werden. In entsprechender Weise wurden 
von verschiedenen Verfassern die Stabilitàtsbedingungen bei verschiedener 
Lange der Sektoren und bei Eintùhrung feldfreier Strecken (0=0, n=0) zwischen 
den feldfithrenden Sektoren berechnet. 


b) Typen von Feldfehlern und ihre 
Auswirkungen. — Bei dem Bau eines Teilchen- 
beschleunigers treten unvermeidliche Abwei- 
chungen von dem idealen Entwurf auf. In 
dieser Arbeit soll untersucht werden, wie 
sich diese Abweichungen auf die Stabilitàt 


Abb. 3. — Stabilitàtsdiagramm nach COURANT, 
LIVINGSTON und Snyper fiir eine Anordnung, 
bei der jeder Periodizitàtsbereich aus zwei 

Sektoren gleicher Linge mit Feldindizes +n, 

ame und — n, besteht (Grenzfall groBen Wertes von 
M). Auf den gestrichelten Linien hat )iman 
Resonanzen mit ganz- oder halbzahligem Wert von Q fiir die vertikale oder horizontale 
Schwingungsrichtung (eingezeichnet fiir M=6). Instabilitàtskurven fiir Kopplune 

resonanzen sind nicht eingezeichnet. 
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der Betatronschwingungen auswirken und unter welchen Umstinden sie 
diese Stabilitàt gefàhrden kénnen. Die beiden Typen von Fehlern, die betrachtet 
werden sollen, sind 


1. Fehler des magnetischen Fihrungsfeldes am Ort der idealen Gleich- 
gewichtsbahn (1), 


2. Fehler im Feldindex. 

In der linearen Naherung der Bewegungsgleichungen kann ein weiterer 
Fehler dadurch entstehen, daB Gl. (E.4) nicht streng erfiillt ist (12). Er soll 
in dieser Arbeit nicht behandelt werden. 

Als Ursachen fiir Fehler im Fiihrungsfeld kommen u.a. in Frage: rema- 
nente Felder und Felder infolge von Wirbelstrémen (insbesondere im Anfang 
des Beschleunigungszyklus) sowie Fehler in der Aufstellung der einzelnen Magnete 
in radialer und vertikaler Richtung. Derartige Aufstellungsfehler rufen wegen 
des inhomogenen Charakters des magnetischen Feldes eine Anderung des 
Fuhrungsfeldes auf der urspringlichen Gleichgewichtsbahn hervor. Ursachen 
fur Fehler des Feldindex sind u.a. gegeben durch unterschiedliches Verhalten 
der einzelnen Magnete, insbesondere bei Sattigung, unterschiedliche Linge und 
Fehler in der Aufstellung der Magnete entlang der Gleichgewichtsbahn. 

Charakteristisch fiir das Synchrotron ist, daB die zu beschleunigenden Teil- 
chen viele Male in der Nahe einer geschlossenen Bahn herumgefiihrt werden 
und daB derselbe Fehler immer von neuem auf die Teilchen einwirkt. Die 
Starke der Auswirkung der Fehler wird dabei ganz davon abhiangen, wie sich 
die Phase der Betatronschwingungen wahrend eines Umlaufs, also zwischen 
zwei Hinwirkungen desselben Fehlers, 4ndert. Derartige Fehler sind bei dem 
neuen « Prinzip starker Stabilisierung » sehr viel bedeutungsvoller als bei dem 
alteren Stabilisierungsprinzip, denn hier kann eine groBe Anzahl von Betatron- 
wellen auf dem Umfang liegen und daher schon eine geringe Anderung der 
Betriebsparameter zu einer wesentlichen Anderung der Differenz der Schwin- 
gungsphase zwischen zwei Umlàufen fiihren. Es wird sich zeigen, daf es kri- 
tische Betriebsbedingungen in der Tat gibt. Ihre Existenz fiihrt zu Grenzen, 
innerhalb deren die Betriebsparameter konstant zu halten sind. LàBt sich 
diese Konstanz nicht erreichen, so muB durch andere Mittel versucht werden, 
die kritische Auswirkung der Fehler zu vermeiden. 

Feldfehler fiihren mathematisch zu Zusatzgliedern in der Bewegungsglei- 


(11) J. D. Lawson (unveròffentlichter CERN-Bericht) machte als erster darauf auf- 
merksam, daB derartige Fehler, speziell von Aufstellungsfehlern der Magnete herriùhrend, 
den Betrieb des Synchrotrons unméglich machen kénnen, falls eine ganze Zahl von 
Betatronwellenliingen auf dem Umfang liegt. i 

(12) Auf diesen Fehler, der zu einer resonanzartigen Kopplung zwischen vertikaler 
und horizontaler Schwingung fihren kann, wurde zuerst von P. A. STURROCK (unver- 


6ffentlichter CERN-Bericht) hingewiesen. 


vel 
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chung (E.6). Zunàchst sollen Fehler im Fihrungsfeld betrachtet werden. Be- 
sonders einfach ist der Fall, wenn dieser Fehler durch fehlerhafte Aufstellung 
der Magnete in radialer Richtung hervorgerufen wird (die Beeinflussung der 
vertikalen Schwingungen erfolgt entsprechend durch Aufstellungsfehler in ver- 
tikaler Richtung). Ist der das Fiihrungsfeld erzeugende Magnet am Ort ? um 
ein Stiick d£(9) radial senkrecht zur Gleichgewichtsbahn aus seiner idealen 
Stellung herausgeriickt, so liegt der Mittelpunkt der rucktreibenden (oder 
forttreibenden) Krafte an dieser Stelle bei r = 6&(0) statt bei r = 0. Statt Gl. 
(E.6) hat man daher jetzt folgende Bewegungsgleichung 


(E.16) (9) — n(9)(r(9) — déÉ(9)) = 0 
oder 
(E.17) r"(d) — n(d)r(8) = — n(P) dé(d), 


also eine inhomogene Differentialgleichung, die in geschlossener Form gelòst 
werden kann, falls zwei linear unabhangige Lòsungen der zugehòrigen homo- 
genen Gleichung bekannt sind (siehe Teil I, Abschn. 1a). Ist, etwas allgemeiner, 
der Fehler im magnetischen Fiihrungsfeld gegeben durch (13) 


(E.18) OHO) =H, 5 (0) Ha (05 


so folgt aus der in Anh. 1 gegebenen Ableitung der Bewegungsgleichungen, 
daf Gl. (E.6) zu ersetzen ist durch 


(E.19) r"(9) — n(d)r(d) = df(0) 
mit 

R? 6H,(9) 
E.20 f(a) = — 
aoe 9 =~) LO) 


(H.(9) = H.nen(8)). 
Entsprechend sei ein Fehler im Feldindex gegeben durch 


(E.21) on(d) = N sea (0) = Naeal) « 
Die modifizierte Bewegungsgleichung (E.6) lautet dann 
(E.22) r"(d) — (n(d9) + òn(d)r(d) = 0 


(2(9) = n; .1(9)). Es entsteht also eine abgesinderte homogene Differential- 
gleichung. Das allgemeine Verhalten der Lòsungen soll weiter unten in diesem 


(1°) H.;aem(?) ist durch Gl. (E.1) definiert, H,,.,(9) stellt dagegen den Wert von 
H,(#) am Ort der idealen Gleichgewichtsbahn in dem realen, mit Fehlern behafteten 
Synchrotron dar. 


=. 
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Absehnitt diskutiert werden. Fiir die explizite Lésunge soll spàter (Teil I, 
Abschn. 2) ein Naherungsverfahren verwendet werden. Im allgemeinen werden 
naturlich beide Fehler gleichzeitig — und dazu noch der hier fortgelassene 
Kopplungsfehler — auftreten. Aus methodischen Griinden ist es jedoch zweck- 
mabig, die Untersuchung der beiden Typen von Fehlern zunichst nach einander 
durchzufthren, In Teil I, Abschn. 3a wird die gleichzeitige Rinwirkung beider 
Fehler diskutiert werden. 

Es soll jetzt eine qualitative Diskussion der Auswirkung beider Typen 
von Fehlern gegeben werden. Bei den Fehlern im magnetischen Fiihrungsteld 
(GI. (E.19)) làBt sich eine mechanische Analogie heranziehen. Hierzu sieht 
man von dem eigentlichen Charakteristikum des Prinzips starker Stabilisierung, 
namlich der Abhangigkeit des Koeffizienten n von 9 ab und betrachtet statt 
dessen Schwingungen, bei denen der Koeffizient in der riicktreibenden Kraft 
konstant ist. Ein System, das derartige Schwingungen ausfihrt, ist etwa dureh 
eine aufgehangte Spiralfeder mit einem angehangten Gewicht gegeben. Eine 
Gleichung vom Typ (E.19) beschreibt dann Schwingungen dieses Systems unter 
Wirkung einer 4uBeren Kraft; speziell in Gl. (E.17) kann dé(9) unmittelbar 
als 4uBere Verruckung des Aufhangepunkts der Spiralfeder interpretiert werden. 
Dem periodischen Durchlaufen immer derselben Fehler entspricht im Modell 
eine periodische auBere Kraft baw. Aubere Verrickung. Nun ist wohlbekannt, 
daB die Federschwingung unter der Hinwirkung einer àaufferen (nicht not- 
wendig sinusformigen) Kraft oder Verrickung dargestellt werden kann als 
Uberlagerung einer freien Schwingung und einer erzwungenen Schwingung mit 
derselben Periode wie die Anregung. Die Amplitude der erzwungenen Schwin- 
gung geht (formal) gegen unendlich, wenn sich die Periode der (i.a. nicht sinus- 
formigen!) Anregung der Periode der freien Schwingung oder einem Vielfachen 
hiervon n&hert. Zurickibersetzt in die Sprache der Betatronschwingungen 
bedeutet dies, daB jede Teilchenbahn unter dem HinfluB von Fehlern im Fiih- 
rungsfeld aufgefaBt werden kann als Superposition einer freien Betatronschwin- 
gung und einer nach einem Umlauf in sich geschlossenen Bahn; diese geschlos- 
sene Bahn kann als geinderte Gleichgewichtsbahn verstanden werden und tritt 
an die Stelle der idealen Gleichgewichtsbahn im stòrungsfreien Synchrotron. 
Nahert sich die Zahl der Betatronwellenlangen auf dem Umfang (die GròBe 
O aus Gl. (H.12)) einer ganzen Zahl, so geht die Abweichung der geanderten 
von der idealen Gleichgewichtsbahn (formal) gegen unendlich. Man hat dann 
eine resonanzartige Auswanderung des ganzen Strahls. 

Auch die Auswirkung von Fehlern im Feldindex kann man sich am Beispiel 
der schwingenden Feder klarmachen. Fehlern im Feldindex, d.h. in der rùck- 
treibenden Kraft, entspricht eine periodische Anderung der Federkonstanten, 
z.B. jeweils eine kurzzeitige Erweichung der Feder. Allgemeiner und unmit- 
telbarer ist eine mathematische Diskussion der Gl. (E.22) mit den in Unter- 
abschn. a bereitgestellten Hilfsmitteln. Der abgeanderte Feldindex n(?) + dn(J) 


dadi 
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ist ia. nicht mehr periodisch mit der Periode &,, wohl aber weiterhin mit 27 
(= ein Umlauf). Auch in Gegenwart von Fehlern im Feldindex bleibt die Dif- 
ferentialgleichung fiir die Betatronschwingungen vom Floquetschen Typ, nur 
ist die Grundperiode jetzt 20. Die Vernichtung der Periode 0) durch die Fehler 
fiilbrt dazu, daB instabile Bereiche auftreten kénnen, wenn die Betatron- 
sehwingungen iiber einen Umlaut periodisch oder antiperiodisch sind, d.h. wenn 
die Gr6Be Q aus GI. (E.12) ganz- oder halbzahlig ist (#4), Ist der Fehler 6n(@) 
klein, so werden diese instabilen Bereiche schmal sein, d.h. sie werden bereits 
bei einer verhiltnismaBig geringen Anderung der Betriebsparameter vòllig 
iiberstrichen; auBerdem werden sie in der Nahe solcher Betriebsbedingungen — 
liegen, fiir die in dem ungestérten Beschleuniger (!5) gerade eine ganze oder | 
halbganze Zahl von Betatronschwingungen auf dem Umfang liegt. Die Insta- 
bilitàt AuBert sich in einer monotonen Verbreiterung des Teilchenstrahls. 

Nach Unterabschnitt a andert sich die Gestalt vonr,(3) baw. 7,(@) (G1.(E.10)), 
wenn ein Stabilitàtsbereich des idealen Synchrotrons monoton durchlaufen 
wird, von einer in #, periodischen in eine in %, antiperiodische Funktion (oder 
umgekehrt). Mit M (vgl. Gl. (H.8)) als Zahl der Perioden des Feldindex auf 
dem Umfang tritt bei einem solchen Durchlaufen des Stabilitàtsbereichs |M —1) 
mal (von den Grenzen des Stabilitàtsbereichs abgesehen) der Fall ein, dab. 
die Lésungen tiber einen vollen Umlauf periodisch oder antiperiodisch sind. 
Dann k6énnen sich jedesmal Fehler im Feldindex durch neue instabile Bereiche 
bemerkbar machen; jeder zweite derartige kritische Punkt (gezahlt von demje- 
nigen Rand des Stabilitàtsbereichs, an dem man eine periodische Lésung hat) | 
fiihrt auch zu einer resonanzartigen Auswirkung der Fehler des Fiihrungsfeldes,. 
also zu einem unendlichen Anwachsen der Amplitude der geainderten Gleich- 
gewichtsbahn. In einem zweidimensionalen Diagramm nach Art der Abb. 3 
erhalt man entsprechende kritische Kurven, von denen die eine Schar kritische 
Betriebsbedingungen fir die vertikalen Schwingungen, die andere fir die hori- 
zontalen Schwingungen angibt (gestrichelte Linien der Abb.). Kritische Kurven 
fiir Kopplungsresonanzen sind nicht eingetragen. 

Die vorliegende Arbeit gliedert sich in zwei Teile. In Teil I werden die 
mathematischen Methoden fiir die Behandlung der Fehler entwickelt und allge- 
meine geschlossene Formeln fiir ihre Auswirkung angegeben, In Teil II erfolgt | 
eine Auswertung dieser Formeln unter statistischen Voraussetzungen iiber die | 


(**) Fur diese instabilen Bereiche wurde der Name «Stopbands» (Sperrbereiche) | 
vorgeschlagen. 
(!°) Die Aufspaltung des wirklichen Feldindex in einen idealen (d.h. mit 4, perio- | 
dischen) Anteil und eine Stérung ist natiirlich nicht eindeutig. Die Ergebnisse werden | 
hierdurch jedoch nicht beeinfluBt. 
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Fehler und eine Untersuchung der auftretenden Wahrscheinlichkeitsvertei- 
lungen. Bei der Planung eines Beschleunigers interessieren im wesentlichen 
derartige statistische Aussagen (16), 


I. — Allgemeine Behandlung. 


1. — Fehler im magnetischen Fiihrungsfeld. 


a) Gleichung der gednderten Gleichgewichtsbahn (17). — Das mathematische 
Problem der unter dem Einflu8 von Fehlern im Fiihrungsfeld geinderten 
Gleichgewichtsbahn besteht darin, soleche Lésungen der Gl. (E.19) zu suchen, 
die periodisch mit der Periode 27 (ein Umlauf) sind 


n 


(1.1.1) rd +22) = 7(). 


Das ist in geschlossenener Form méglich. Denn die allgemeine Lésung der 

Gl. (E.19) kann angegeben werden, falls zwei linear unabhingige Lésungen 

#:(9), v0) (und daher die vollstàndige Lésung) der zugehòrigen homogenen 

Gleichung (Gl. (E.6)) bekannt sind. Die allgemeine Lésung der inhomogenen 
Gleichung ist nàmlich dann gegeben durch 


id 
(1.1.2) r(9)= ayrs(9) + ara) co dd! Of(9' 40449) 1B) — r(9)r,(9)} . 


0 


Fir die Ableitung folgt hieraus 


È d 
; aL aig iy, PERE 
Mx.1.3) r(9)= ar1(9) + asrs(9) + wf? OFM fr (9) ra) — 1'o(") 11 (P)} 


(16) Ahnliche Untersuchungen wie die hier vorzulegenden sind an verschiedenen 
Stellen unabhingig von einander durchgefihrt worden. Veròffentlichungen hieriber 
liegen kaum vor; die unveròffentlichten Berichte waren dem Verf. nur teilweise zugang- 
lich. Eine Wiirdigung aller parallelen Entwicklungen war nicht méglich und wurde 
auch nicht beabsichtigt. 

(7) Die Bedeutung der Existenz einer gedinderten Gleichgewichtsbahn in Gegen- 
wart von Fehlern im Fithrungsfeld wurde dem Verf. Ende des Jahres 1952 deutlich; 
damals fanden in Kopenhagen im Zusammenhang mit den kritischen Bemerkungen von 
Lawson (vgl. FuBn. (*2)) eine Reihe von Diskussionen statt, insbesondere mit Herrn 
T. Sicurcerrsson. Berechnungen und statistische Abschitzungen der geinderten Gleich- 
gewichtsbahn wurden fiir ein vereinfachtes mathematisches Modell (konstantes, aber 
groffes n) unabhingig u.a. von E. D. COURANT und E. BopensteDT durchgefùhrt. 
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Die Grobe W bedeutet in diesen Gleichungen die durch (E.9) definierte Wronski 
determinante. Die GréBen a, und 4, sind beliebige Konstante. 

Es ist zweckmABig, fiir r,(9) und r,(9) die durch GI. (E.10) definiertet 
Floquetschen Lésungen zu verwenden, die einfache Periodizitatseigenschafter 
besitzen. Innerhalb eines Stabilitàtsbereichs, also fiir reelles w (Gl. (E.11) 
kann man sie konjugiert komplex zu einander wahlen; die Wronskidetermi 
nante ist dann rein imaginàr (15). Fiir eine wirkliche Teilchenbahn, also reelle; 
r(), sind dann die Konstanten a, und a, in Gln. (1.1.2), (1.1.3) konjugier 
komplex zu einander. Wegen Gln. (E.11), (E.8) und (E.12), gilt fiir einen voller 


Umlauf 


| ri(9 + 27) = exp [2719]. (0) , 
(1.1.4) { 
| rs(d + 27) = exp [— 2ri@]r:(0) . 
Gleichwertig mit der Periodizitàtsbedingung Gl. (I.1.1) sind die beider 
folgenden Bedingungen 
(1.1.5) #27) = F(0), (272) = (0). 


Sie schreiben sich wegen Gln. (1.1.2), (1.1.3) folgendermaBen 


a, (1 —exp [— 2riQ])) 71 (22) 4 ds(1 — exp [+ 2miQ)]) r.(22 ) + 


Qa 


| 
Jaar 
i + yy [WAP 01 
(Oboe ie 
| a, (1 — exp [— 2xriQ])r:(27) + a,(1 — exp [ + 2riQ])r:(22) + 
| 1 20 x 
| as W [as TAI) rg(2ct) — r4(9) ri (Qo) = OF 


0 


Dieses Gleichungssystem ist nach den Konstanten @, und 4, aufzulésen; dani 
stellt (1.1.2) die Gleichung der geainderten Gleichgewichtsbahn dar. Es ist abe 
zweckmaBig, Gl. (1.1.6) nicht als inhomogenes Gleichungssystem tir @, und è 
sondern als homogenes Gleichungssystem fiir die bei r,(2a) bzw. 7,(27) uni 
¥>(270) bzw. r", (220) stehenden Faktoren aufzufassen. Die Determinante diese 
Gleichungssystems ist die Wronskideterminante, also ist sie von null verschieder 


18) ta PI A RINTO Ae x = ; : 
(15) Aussagen tiber das Vorzeichen der Wronskideterminante werden in Anh. 4 
gewonnen werden. 


«= 
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Damit hat man sofort 


27 27 
JOF(Y) r.(9) ad fof) r, (9) dd 
0.1.7) a, = = RAI se 
W(1—exp[— 2r:iQ]) 3 W(1— exp[+ 2ri0Q]) © 
Man sieht somit, daB die geanderte Gleichgewichtsbahn in der linearen Nihe- 
rung eindeutig bestimmt ist, solange die Zahl Q der Betatronwellenlàngen auf 
dem Umfang von einer ganzen Zahl verschieden ist. 
Durch Einsetzen von Gl. (1.1.7) in Gl. (1.1.2) kann jetzt ein geschlossener 
Ausdruck fiir die geanderte Gleichgewichtsbahn gewonnen werde. Es ergibt 


sich 
Pee 1 7 ; | ri(0)r3(9) r(0)r;(9) | 
1.1.8 (9) = STI E eee ACE bi ( 
M1.8) Fo) wi I esagerati 
d—-27 


Diese Gleichung wird am einfachsten gewonnen, indem man in Gln. (1.1.2) 
und (I.1.7) den Nullpunkt so verschiebt, daB der gerade betrachtete Ort auf 
der Gleichgewichtsbahn bei 4 = 27 leet, was keine Hinschrankung der All- 
gemeinheit bedeutet. 

Jede andere Teilchenbahn (andere Wahl der Konstanten 4,, a,) ftihrt freie 
Betatronschwingungen um diese geanderte Gleichgewichtsbahn herum aus. 
Untersuchungen iber den EinfluB von St6érungen im magnetischen Fthrungs- 
feld, hervorgerufen durch Aufstellungsfehler, wurden ohne Anwendung des 
Konzepts der geanderten Gleichgewichtsbahn von verschiedenen Autoren 
durchgefiihrt (7°). Nattirlich zeigt sich auch dann, daB jede Betatronschwingung 
in der Amplitude beschrankt bleibt, falls Q nicht ganzzahlig ist. Verf. zieht 
jedoch aus methodischen Griinden den Begriff der geanderten Gleichgewichts- 
bahn vor; dieser wird sich auch bei der Untersuchung der gleichzeitigen Aus- 
wirkung von Fehlern im Fiihrungsfeld und im Feldindex als niitzlich erweisen. 

Alle expliziten Uberlegungen dieser Arbeit vernachlassigen den EinfluB der 
Beschleunigungsspannung und der Anderung der Betriebsparameter (des 
«idealen » Feldindex sowie aller Fehler) wahrend des Beschleunigungszyklus. 
In Wahrheit liegt aber kein stationàres Problem vor. So bleibt die Lage der ge- 
anderten Gleichgewichtsbahn sicher nicht wihrend des ganzen Beschleunigungs- 
zyklus dieselbe. Man kann jedoch zeigen, daB der Teilehenstrahl einer hinrei- 
chend langsamen Anderung der momentanen Gleichgewichtsbahn tolgt. 

Im Resonanzfall selbst (Q ganzzahlig) existiert nach Gl. (1.1.7) im allge- 
meinen — wenn nicht gerade die auftretenden Integrale verschwinden — keine 


(19) U.a. E. ReGENSTREIF (unveréffentlichte CE RN- Berichte), J. Sie BB neel ee nieve, 


T/R 1314; M. SAND und B. ToverneK: Nuovo Cimento, 10, 604 (1953). 
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ceschlossene Bahn im Synchrotron, Alle Bahnen werden nach einem Umlaut È 
um dasselbe dr(9) verschoben; fiir dieses gilt nach Gl. (1.1.2) 


d 
(1.1.9) or) = E | 49° SHIN (9) (9) — ral") (8) 


9-2 


Physikalisch ist das monotone Auswandern aller Teilchenbahnen sofort ver-. 
stàndlich: Die Fehler wirken stets in derselben Phase auf die Teilchen ein: 
und superponieren sich daher fortwahrend statt sich im Lauf der Umlaufe 
wieder zu kompensieren. Die Bedingung fiir das Unterbleiben der Auswan- 
derung, nimlich das Verschwinden der in Gl. (I.1.9) auftretenden Integrale, 
bedeutet anschaulich, daB die Stérungen an keiner Bahn tiber einen Umlaut 
Arbeit leisten. 


b) Spezielle Falle. — Ist der Fehler im Fiihrungsfeld streng periodisch 
mit der urspriinglichen Periode 3, 


(1.1.10) OF(F + Bo) = did), 


so treten bei ganzzahligen Werten von @ keine Resonanzerscheinungen auf; 
das Teilehen « merkt » gar nicht, wann ein Umlauf voriiber ist. Dieser Fall 
ist allerdings nur von mathematischem Interesse, da sich eine derartige strenge 
Periodizitàt praktisch nicht verwirklichen laBt. Die geànderte Gleichgewichts- 
bahn ist nach Unterabschn. « eindeutig bestimmt und daher mit der in & 
periodischen Bahn identisch. Man hat lediglich ein etwas abgeindertes ideales 
Synchrotron. Mathematisch laBt sich die Behauptung der Resonanzfreiheit leicht 
aus der allgemeinen Theorie gewinnen. Fir den Koeffizienten von r.(#) in 
Gl. (1.1.8) gilt nàmlich wegen (1.1.10) und (E.10) 


[af ry(9 ao" È Î31(9)r (91) ad 


Digli da IPSE _ da 
CO Wea ei) ST = we eee aa 


Der zweite Spezialfall, der in diesem Unterabschnitt behandelt werden soll, 
bezieht sich auf Feldfehler, die allein durch Aufstellungsfehler nach Gl. (B.17) 
hervorgerufen werden; der Aufstellungsfehler d£(2) soll dabei eine verglichen 
mit der Betatronwellenlànge langsam verànderliche Funktion darstellen. Im 
mechanischen Modell der schwingenden Feder bedeutet dies, daB die Lage 
des Aufhingepunktes langsam verglichen mit der Schwingungsperiode verinder 
wird. Dort ist das Resultat klar: das aufgehingte Gewicht wird der Bewegung 


iù ) ‘ a 
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des Authingepunktes einfach folgen (2°). Ebenso wird man erwarten, da6 auch 
im Synchrotron mit starker Stabilisierung die Gleichgewichtsbahn solchen 
« adiabatischen » Aufstellungsfehlern einfach folgt. Wegen der komplizierteren 
Bewegungsgleichung und des komplizierteren Einflusses von Aufstellungs- 
fehlern ist das jedoch nicht ganz selbstverstàndlich und soll besonders bewiesen 
werden. 

Durch Vergleich der Gln. (E.17) und (E.19) folet unter Heranziehung der 
Gl. (E.6) fir die freien Betatronschwingungen zunichst 


ov 


pe d d 
(1.1.12) I Of(#') ry (9) a = — | dE(9)m(9P)r:(9)) A = — | dE(9)r(9) AM" . 
9-27 Os 


OD — 276 3-27 
Nach zweimaliger partieller Integration ergibt sich hieraus 


d 
n 


2.13) Jone (9) d= (— dE(8)r1(9) + 6&9) 1 (9)) (1— exp [— 22iQ]) — 


3-27 
oe 


— [EMMA 


3-27 


Wenn sich 6&(#) langsam andert verglichen mit 7,(9), so wird das Integral 
auf der rechten Seite Klein sein gegen das auf der linken, d.h. gegen die rest- 
lichen Glieder auf der rechten Seite. Einsetzen in die Gl. (I.1.8) fiir die ge- 
anderte Gleichgewichtsbahn und Beachtung der Definition (E.9) der Wronski- 
determinante liefert somit 


f r:(9)rs(8) r(P') (9) 


è 
x 5 i (Gt aS 
(11.14) r(0) = 00) — 7 | VU O) 


3-27 


wobei die auBer d&(%) auf der rechten Seite stehenden Glieder einen Korrektur- 
term darstellen. Vielfach wird bei derartigen adiabatischen Aufstellungsfehlern 
auch das Vakuumrohr der Kurve 6&(#) folgen und die einzigen Glieder von 
praktischem Interesse sind diese Korrekturglieder. 


‘e) Anwendung auf die «synehrone Bahn». — In den vorangegangenen 
Unterabschnitten wurden nur solche Teilehen betrachtet, deren Energie bzw. 


(29) Dieses Ergebnis, formuliert fir Betatronschwingungen, wurde von E. D. Cou- 
RANT (unver6ffentlicht) aus seinen vereinfachten Rechnungen herausgelesen und aus- 
© gesprochen. 


1092 G. LUDERS 


Impuls mit dem Magnetfeld des idealen Synechrotrons wahrend des ganzen 
Beschleunigungszyklus durch Gl. (H.1) in Beziehung steht. Solche Teilchen 
wiirden in dem idealen Synchrotron unter idealen Betriebsbedingungen standig 
auf der Gleichgewichtsbahn umlaufen. Tatsàchlich aber treten im Synchrotron 
auch Teilehen mit einem Impulsfehler dp auf. Dieser Impulsfehler und der 
Fehler in der Phasenlage in Bezug auf die beschleunigende Wechselspannung 
fiihren Schwingungen um die Nullwerte aus (*). Teilchen mit (fur den Augen- 
blick als zeitlich konstant angenommenem) Impulsfehler besitzen eine von der 
idealen Gleichgewichtsbahn verschiedene Gleichgewichtsbahn. Diese Bahn wird 
als « synchrone Bahn» bezeichnet. Die Lange der synehronen Bahn ist von 
derjenigen der idealen Gleichgewichtsbahn verschieden. Mit dieser Weever- 
langerung oder -verktrzung ist eine Anderung der Umlaufszeit verbunden; 
hierauf beruht der Mechanismus der oben erwahnten Impuls- und Phasen- 
schwingungen, Natiirlich liegt bei der Berechnung der synehronen Bahn eigent- 
lich kein stationàres Problem vor. Infolge der Impulsschwingungen verandert 
sich nàmlieh die synchrone Bahn langsam (verglichen mit der Wellenlange 
der Betatronschwingungen); diese oszillatorische Veranderung der synchronen 
Bahn wird als Synehrotronsechwingung bezeichnet. Im Vakuumrohr mu in 
horizontaler Richtung hinreichend Platz fur die Synchrotronschwingungen vor- 
handen sein. 

Die synehrone Bahn soll jetzt, unter Vernachlassigung der langsamen Ver- 
anderung von dp, berechnet werden. Teilchen mit einem derartigen Impuls- 
fehler finden auf der urspriinglichen Gleichgewichtsbahn nicht das ihrem Impuls 
entsprechende Magnetfeld vor (von eigentlichen Feldfehlern sei hierbei abge- 
sehen); es weicht nach Gl. (E.1) von demjenigen, das sie auf der Gleichge- 
wichtsbahn halten wiirde, ab um 


È ) 

(1.1.15) 6H.(9) = — H.(9) P. 

p 

Die zugehérige GréBe 0df(2) (Gl. (E.20)) ist fiir solche Teilchen daher segeben 
durch 


(1.1.16) Of). Es : 


st der Krimmungsradius 0(9) streng periodisch mit Jy, So lautet die Gleichung 
der synchronen Bahn fiir einen Impulsfehler dp foleendermaBen 


DI 
ag Jf (Ord) ra(0) (8) | Op 


] 
(LE LT) esco 9) 2 r 2 = 3 
W | o(d')\1—exp[_ia] 1—exp fiu]f » 
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i pe 1 T tera a 5 fù RIO . 7 a ‘e 
Die relative Wegverlingerung ergibt sich nach einfachen geo metrischen Uber- 


legungen zu 


do 
(1.1.18) OL 1 [Fena:(d) fg 
L &J 09) 
Nach Hinsetzen von (1.1.17) folet hieraus 
o do d 
Gain EI fas fag 2 Bf n n 1% 
L WO.) 09) o(9) |1— exp[— iu] 1- exp[+ iu] ‘p 


0 3 -do 


und nach partieller Integration 


do » 
OL 2, hi R È k Op Op 
ie ee one ee ig, 9! i ae ee 
I Wa txp aa i 0(9) r:(0) | di CA r(0") 5 x me 
0 aD, 


Allein die GréBe x geht in die Theorie der Impuls- und Phasenschwingungen 
ein. Insbesondere ist die « Ubergangsenergie » E; bei der die Phasenschwin- 
gungen instabil werden, nach CLS fiir x > 1 gegeben durch 


, Hs, 
M1:24) Tue a 
40) 


Hierbei ist £; als Gesamtenergie (einschl. Ruhenergie) gemeint und £, stellt 
die Ruhenergie der Teilchen dar. 

Es ist nach der gegebenen Ableitung klar, daB sich die Betatronschwin- 
gungen der synchronen Bahn einfach tiberlagern. Weicht auBerdem das Fuùh- 
rungsfeld auch fiir Teilehen mit dp = 0 von dem idealen Wert ab, so tiberlagert 
sich die synchrone Bahn der in Unterabschn. a berechneten geanderten Gleich- 


gewichtsbahn. 


2. — Fehler im Feldindex. 


Das Auftreten instabiler Bereiche in der Nahe von ganz- oder halbzahligen 
Werten von 9, der Zahl der Betatronwellenlangen auf dem Umfang, wird 
durch die Uberlegungen am SchluB von Unterabsehn. b der Hinleitung all- 
gemein bewiesen. Durch Untersuchung spezieller Beispiele wurden diese Insta- 
bilitàten unabhangig vom Verf. durch Lundquist und Regenstreif (*') gefunden. 


REGENSTREIF, unverolfentlchte 


‘ 


(24) S. LUNDQUIST: Phys. Itev., 91, 981 (1953); ‘E. 
CERN- Berichte. 
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In diesem Abschnitt soll eine storungsmàBige Behandlung des Problems durch- 
gefiihrt werden, die sich naturgemàf besonders fiir kleine Fehler des Feldindex 
und kleine, die strenge Periodizitàt stérende, Aufstellungsfehler entlang der 
Gleichgewichtsbahn sowie Liingenfehler der einzelnen Magnete eignet. 


a) Storungsrechnung fur Fehler im Peldindex. — Die Wirkung eines Fehlers 
im Feldindex wird, im Gegensatz zu derjenigen eines Fehlers im Fithrungs- 
feld, durch eine homogene statt einer inhomogenen Differentialgleichung 
(Gl. (E.22)) beschrieben. Eine entsprechend einfache geschlossene Losung kann 
daher nicht angegeben werden. Natiirlich ist das strenge Durchrechnen spe- 
zieller Problenie méglich; ein allgemeinerer Einblick wird jedoch durch eine 
Stòrungsrechnung gewonnen. 
Es stellt sich die Aufgabe, ftir die Hillsche Differentialgleichung (H.22) 
Lòsungen (« Floquetsche Lésungen ») 7,(9) und #.(9) mit den Periodizitàts- 
eigenschaften 


| F.(9 + 2x) = exp [2721 F, (9) , 
(0.2.1) i = 
| F.(9 + 22) = exp[— 2aiQ] Ta(V) , 


zu finden; vgl. Gl. (E.10) und beachte, dab der Feldindex jetzt nur noch 
periodisch mit der Periode 27 ist. Es werde gesetzt (vgl. Gln. (H.11) und 
(E.12)) i 


(9) = v1(9) exp [90], F,(0) = d1(9) exp [199], 


x 


Lees) ] a 
| rs(9) = v,(8) exp[— iQ], 7.(9) = v.(9) exp [— 199], 


wobei sich also der linke Satz von Lésungen aut das ungestérte Problem (mit 
On(9) = 0), der rechte dagegen auf das gestérte bezieht. Fiir die Funktion 
v, (2) gilt die Differentialgleichung 


(1.2.3) 0.(9) + 2i001(8) — (n(9) + Q2)0,(9) = 0 


mit der Periodizitàtsbedingung 


fur die Lòsung %,(@) des gestérten Problems gilt eine entsprechende Gleichung 
(1.2.5) B,(9) + OBI) — (n(9) + dn) + F2)B (9) = 0 
und Periodizitàtsbedingung 


{1.2.6) dd + 2) = B,(9) . 
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Entsprechende Gleichungen, mit jeweils umgekehrtem Vorzeichen von 9 baw. a) 
gelten fiir v,(9) baw. F.(0). 

Es liegt nahe, in Analogie zur Schrédingerschen Stòrungsrechnung in der 
Quantentheorie zu setzen 


og een 
| 01 (7) = TAI) + BHP) 
und dann zu fordern 
BP"(B) + AYOT I) — (n(9) + TOTI) = 0, 
BE" (9) + UGoOR”'(g iy +90 76) — = 


E nig + 29GB + dn DB) , 


| 
ae 


sowie 
[TAI +22) =T(9) 
(1.2.9) Td +22 =D (9), 


Falls @ weder ganz- noch halbzahlig ist, folgt aus den ersten Gln, (1.2.8) und 
(1.2.9), daB man setzen kann 


(1.2.10) Q©=4¢, dI) = 0,(9) . 


Allerdings ist die GréBe Q nur bis auf eine beliebige additive ganze Zahl und 
auch das zugehòrige 7,(3) nur bis auf einen beliebigen konstanten Faktor be- 
stimmt, aber die so berechneten Funktionen 7,(9) unterscheiden sich dann 
nur um einen Zahlenfaktor. Bei ganz- oder halbzahligem @ werden die ersten 
GJn. (1.2.8) und (1.2.9) auch von v,(#) exp [— 2090 ] erfullt; es liegt dann also 
ein entartetes Problem vor. 

Bei einer ganzen Zahl von Betatronwellenlangen auf dem Umtfang soll zur 
Vereinfachung der mathematischen Behandlung die in Gl. (1.2.2) auftretende 
Gr6Be Q so umdefiniert werden, daB Q = 0 gilt. Man hat dann einfach 


(1.2.11) r;(9) = (0), r,(9) = 02(9), 
und die Entartung des Problems (im Sinne der Stòrungsrechnung) wird evident, 


da v,(9) und v,(9) derselben Differentialgleichung und derselben Periodizitàts- 
bedingung gehorchen. Fiir halbzahliges @ soll ebenfalls formal @ — 0 gesetzt 
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und dafir die Periodizitàtsbedingungen (1.2.4), (1.2.6), (1.2.9) durch die ent- 
sprechenden Antiperiodizitàtsforderungen ersetzt werden. Aus der ersten 
G1.(F.2.8) kann dann Q=0 getolgert werden; 77 (?), die nullte Naherung des 
sestorten Problems, ist jetzt aber als Linearkombination von r;(3) und r.(9) 


anzusetzen 


(12:12) TOI) = ar, (9) + ar: (0). 


Das Verhaltnis der Koeffizienten aj? und a, kann, genau wie bei dem ent- 
sprechenden entarteten Problem der Mi Storungsrechnung, erst 
aus der Gleichung fiir die erste Naherung bestimmt werden. 

In der zweiten Gl. (1.2.8) (mit Q=0) ist die rechte Seite bekannt: die 
Loésung dieser Gleichung kann nach GI. (1.1.2) in geschlossener Form ange- 
geben werden 


~ 1 «<A Ain ~ / 
(1.2.13) PPA) = ann) + agra?) + [ast 2iQLVOO'(D) + dn(F' JOO’ )} - 


Py, 


0 


{7 (8") ro(P) — r(9)r1(0)} , 


wobei sich die Koeftizienten a‘), a dann erst aus der zweiten Naherung er- 
geben. Kine entsprechende Gleichung (mit Ersetzung von r,(#), 7.(0) durch 
ihre ersten Ableitungen) gilt nach Gl. (1.1.3) fiir die erste Ableitung 7) (9). 
Jetzt ist die Periodizitàts- (oder Antiperiodizitàts-) bedingung (1.2.9) heran- 
zuziehen. Es miissen die in der Gleichung fiir 7() (GL (1.2.13)) und T°(9) 
stehenden Integrale verschwinden, wenn sie von 0 bis 27 erstreckt Se; 
In derselben Weise, wie Gl. (1.1.7) aus Gl. (1.1.6) gewonnen wurde, folgt dann 


(122.14) (— 2197 (9) + dn (9) FO) [ple n ao =0, 


J Mal 


Dies ist das Gleichungssystem, aus dem nach Hinsetzen von Gl. (1.2.12) die 


Gròfe Y™ und das Verhaltnis der Koeffizienten a? und a® zu bestimmen sind. 


Es werde jetzt die Abktirzung 
270 


(12.1.5) Mix = | 0n(9)r;(9)r,(9) dd, 


(j, k = 1, 2) eingefiihrt. Offenbar ist diese GréBe symmetrisch in den Indizes j, k. 
Ferner gilt fiir konjugiert komplexe Funktionen 7,(9), r,(9) (vel. Abschn. la) 


(1.2.16) Nie NG N = NS eee 
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Unter Benutzung der Definition ( 1.9) der Wronskideterminante und unter Be- 
achtung der Periodizitàt (bzw. Antiperiodizitàt) von r,(9), r,(9) zeigt man auBer- 
dem leicht 


at an 

| [rx (9) 169) dd = — [ro(9) ri(9) dd ur 
(1.2.17) la di 

| [rx (9)1{(9) ad = [r.(0)1:(9) aa — 0. 


Somit schreibt sich das aus Gl. (1.2.14) folgende Gleichungssystem fiir die 
Koeffizienten a), a, 


wellenlangen auf dem Umfang in niedrigster Naiherung zu berechnen erlaubt, 


das gleichzeitig die Anderung (™ der Zahl der Betatron- 


in der Form 


: ( = eee pie a NERI > eet 
(1.2.18) af(Ny +20 WQ') +a2N.. = 0, aN, + AN — 201 WO) = 0. 


> 


Die Determinante dieses Gleichungssystems muB verschwinden; unter Be- 
achtung der Symmetrie von N,, erhalt man damit fiir Q™ die Gleichung 


: - det A 
1.2.19 Di et 
‘ ee} (221 W)? 
mit 
(1.2.20) detti N= Ni,Nao — NiNa : 


Man hat in dieser Naherung Stabilitàt ftir reelles Qo (oder (Quei 0) 
und Instabilitàt fiir imaginàres Q™ (oder (Q®)< 0). Da die Wronskideter- 
minante nach Abschn. la eine imaginàre GròBe darstellt, hat man Stabilitàt 
fir det N <0 und Instabilitat fiir det N > 0. Fine genauere Diskussion der 
Folgerungen aus dieser Gleichung erfolgt im nachsten Unterabschnitt. 

Finige weitere Anwendungen der hier skizzierten St6rungsrechnung werden 


in Anh. 2 gebracht. 


b) Gleichungen fur den instabilen Bereich. — Die Grundlage fur die Unter- 
suchung der durch Fehler im Feldindex hervorgerufenen Instabilitàt bildet 
GI. (1.2.20). Wir zeigen zunàchst, daB eine Instabilitàt nicht auftritt, wenn 
Sn(9) periodisch mit der urspriinglichen Periode J, ist. Physikalisch leuchtet 
das sofort ein, da man durch eine solche Anderung des Feldindex (wenigstens, 
wenn sie hinreichend klein ist) nur einen abgezinderten idealen Beschleuniger 
erhalt, in dem sich die Periode 27 in keiner Weise bemerkbar macht. Mathe- 
matisch ergibt sich dieses Resultat durch Berechnung von N. Es gilt nam- 
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lich (vgl. die entsprechenden Uberlegungen am Anfang von Absehn. 1b) 


do 


Ma = IL (I) (1, (9))°A9- Y expl2i(m — Du] = 


m=1 
) 


d 


| 
| 
(1.2.21) | 
| = | bn() (71 (P))2 dd 


exp [dai] — 1 
exp [2iu]—1 ’ 


und das verschwindet fiir ganz- und halbzahligen Wert von Y. Nach GI. (1.2.20) 
hat man dann stets 


tw 
\ 


(1.2.22) Pig 


also Stabilitàt. 

Zwecks Untersuchung der durch Fehler im Feldindex hervorgerufenen 
instabilen Bereiche soll der ungestòrte Feldindex n(2) in zwei Anteile zerlegt 
werden, einen Anteil, der exakt zu ganz- oder halbzahligem Q fuùhrt und ein 
Zusatzglied dn,,.(9), das die Anderung des idealen Arbeitspunktes bei Uber- 
streichen einer kleinen Umgebung dieser Resonanzbedingungen wiedergibt. 
Formal damit gleichwertig ist, die GròBe n(9) in GL (E.22) als Feldindex, der 
zu ganz- oder halbzahligem Wert von @ fihrt, zu verstehen, und das Zusatz- 
glied aufzuspalten in zwei Anteile 


(os) On(9) = Oa.) + MA), 
wobei òn,.,(9) die Anderung des idealen, streng mit #, periodischen Feldindex,. 


62) aber den eigentlichen Fehler bedeutet. Genau genommen wird der Fehler 
df(3) auch von der Wahl des Arbeitspunktes, d.h. von 6n,,,.,(0) abhangen; 
jedoch diirfte es sich hierbei um einen Etfekt héherer Ordnung handeln. 

Auf diese Weise spaltet N,, (Gl. (I.2.15)) in eine Summe von zwei Termen 
auf, wobei zunachst der von 6n,,,,,(0) herrithrende Anteil untersucht werden 
soll. Es soll gezeigt werden, daB gilt 


[ 270 ad 
| ONiaea(?) (1. (3)) 4 d D, == | ONiaeai (87) (72()) 2 dd =0 ; 
(1.2.24) } Mi i 
Jorn) r(P) 12(9) dd = — 2riW dQ®, 


0 


wobei dQ die Anderung des Wertes von @ fiir den fehlerfreien Beschleuniger 
(in erster NaAherung) bei Anderung von n(3) um én... (9) bedeutet. Fiir 


ideal 
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On(9) — nea (0) mus naimlich 7,(%) im Grenzfall On(?) — 0 in r;(9) iber- 
gehen, d.h. Gl. (1.2.18) muB richtig sein fiir a® 
Tat zu der Behauptung (1.2.24). Die erste dieser beiden Gleichungen hatte 
auch aus den Periodizitàtsbetrachtungen am Anfang dieses Unterabschnitts. 
gewonnen werden kònnen; fiir die zweite Gleichung wird in Anh. 2a ein un- 
abhangiger Beweis gegeben werden. 

Fùhrt man jetzt die Bezeichnung 


= 0. Das fubrt aber in der 


27 


(1.2.25) Ne = | dn(0)r(B)r,(9) dd 


ein, so folgt fiir die in Unterabschn. a definierten Gr6Ben N,, in leicht ver- 
standlicher Matrixschreibweise 


EINE Oana RNS OWN, 
1.2.26 | ci dI = — 2niW d »( aes (fai a 
i Na. Na di LAO, \}, N59 
und fur det N erhalt man 

Near ey al 
1.2.27 det N== — (2; wyet(ogr— Sah as 
( ) hea) 2riW)  (2niW)?f 
und somit nach Gl. (1.2.19) 
2.28 QU 6Q@ 2 SS a ME 

a et sans oni W 


Aus dieser Gleichung lassen sich in Verbindung mit Gin. (£.2.18) und (1.2.12) 
alle erforderlichen Auskiinfte iber die in der Nahe ganz- oder halbzahliger 
Werte von @ liegenden instabilen Bereiche — wenigstens in niedrigster stòrungs- 
theoretischer Naherung — gewinnen. Es soll noch einmal wiederholt werden, 
daB 6Q die Differenz zwischen dem Wert von Y in dem zugeordneten idealen 
Synehrotron und der néchsten ganzen oder halbganzen Zahl bedeutet, qu aber 
dadurch definiert ist; da® gemaàB Gl. (1.2.1) in niedrigster stérungstheoretischer 


Naherung gilt 


7(0 + 2m) = exp [2ridW]F.(d), 


F(9 + 2) = + exp [— 2a19] 7, (0) . 


= 

bi 

ine 

= 
cr 


Das obere Vorzeichen gilt in der Umgebung ganzzahligen, das untere in der 
Umgebung halbzahligen Wertes von Q. Man hat Stabilitàt fiir reelles, Insta- 
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bilitàt fiir imaginàres YQ. Die Grenzen des instabilen Bereichs liegen bei 
QU= 0. In Abb. 4 ist qu als Funktion von 6Q@ aufgezeichnet. Die Breite 
des instabilen Bereichs ist gegeben durch 


Ka 
Na NO 
12:30 Oy == ln 
( ) U inst a mV 
300851 man hat — in dieser Naherung — keinen instabilen 


+ Bereich, wenn N, verschwindet. Der Abstand des 
Mittelpunktes des instabilen Bereichs vom nachsten 
ganz- oder halbzahligen Wert von 7 ergibt sich zu 


CARO | 
Abb. 4. — 00° nach Gl. 
(1.2.28) mit Finzeichnung 
einiger im Text auftreten- a ENS 


der GréBen. (1.2.31) OO mite = Dari W 


Das ganze Stòrungsverfahren ist natirlich nur sinnvoll tir Werte von 60°, 
die klein gegen eins sind. 

Es ist innerhalb gewisser Grenzen willkurlich, was man bei einem gege- 
benen fehlerhaften Beschleuniger den zugeordneten fehlerfreien, idealen Be- 
schleuniger nennt. Die Ergebnisse, insbesondere die in Gl. (1.2.30), (1.2.31) 
enthaltenen Aussagen, werden hierdurch jedoch nicht beeinflubt. Bei einer Um- 
CS, von ,,.(0), d.h. bei einer geaànderten Aufspaltung (I.2.23), wird 
Gl. (1.2.30) ilberhaupt nicht veràndert; dQ“), andert sich aber gerade in solcher 
Weise, dali die objektive Lage des instabilen Bereichs dieselbe bleibt. 


ce) Flunken der instabilen Bereiche. — Arbeitet man unter instabilen Be- 
triebsbedingungen, also innerhalb eines der instabilen Bereiche, so gibt es eine 
spezielle Bahn, deren Amplitude wahrend der Umlàufe monoton abnimmt, 
und eine andere, deren Amplitude monoton zunimmt. Die allgemeine Teilchen- 
bahn ist eine Linearkombination dieser beiden speziellen Lésungen der Be- 
wegungsgleichung, wobei sich im Verlaut der Umlaufe die Lésung mit zu- 
nehmender Amplitude durchsetzt. Arbeitet man zwar im stabilen Gebiet jedoch 
dicht an der Grenze eines instabilen Bereichs, so wird man ein schwebungs- 
artiges Schwanken der Amplitude jeder Betatronschwingung erwarten, denn 
eine Zeit lang werden die Fehler des Feldindex etwa in solcher Weise wirken, 
dafi die Amplitude zunimmt; dann jedoch verschiebt sich die Phasenlage der 
Schwingung relativ zu den Fehlern und sie wirken in umgekehrter, die Ampli- 
tude vermindernder, Richtung; anschlieBend wird wiederum eine Amplituden- 
zunahme erfolgen, und so fort. Die GréBe dieser Schwebungen kann einem ver- 
bieten, zu nahe an den Grenzen eines instabilen Bereichs zu arbeiten. Man 
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i kann v nke instabi el ; ie sich i i 
oe eae des instabilen Bereichs sprechen, die sich in den stabilen 
Bereich hinein erstrecken. 

Diese qualitativen Uberlegungen sollen jetzt quantitativ verschàrft werden (50)? 
Hierzu ist die in den vorangehenden Unterabschnitten im Prinzip ermittelte 
Gestalt der Bahnen in der Nahe ganz- oder halbzahligen Wertes von Q genauer 
zu untersuchen. 

Bei dem idealen, fehlerfreien Beschleuniger kann jede Betatronschwingung 

in der Form 


(1.2.32) r(d) = ér.(9) + E*r3(9) 


dargestellt werden (r,(?) und r,(?) sind Floquetsche Lésungen nach Gl. (E.10); 
es sei r,(9) = r7(2)). Hierbei seien € und é* zueinander konjugiert komplexe 
Konstante. Schrankt man die Variable # auf der rechten Seite der Gleichung 
(aber natirlich nicht auf der linken Seite!) etwa auf das Intervall 0<9 < Qn 
ein, so multiplizieren sich € bzw. €* bei jedem Umlauf mit exp [27iQ] bzw. 
exp [— 27iQ]; diese Gròfen springen also bei unverandertem Betrag in der 
Phase. Es soll angenommen werden, dai die maximale Amplitude einer Be- 
tatronschwingung unabhangig von den speziellen Anfangsbedingungen pro- 
portional zu |É| ist. 

In einem Beschleuniger mit Fehlern im Feldindex gilt an Stelle von 
Gl. (1.2.32) 


(1.2.33) r(9) =F. (9) + °F (9) 


mit 7,(9) und 7,(2) nach Gl. (1.2.1). Beschrankt man auch hier die GréBe 4 
auf der rechten Seite auf das Intervall 0 << 27, so springt die Phase der 
GròBen 7 bzw. 7* mit jedem Umlauf um 210 baw. — 210. Nach GI. (1.2.12) 
gilt nun nàherungsweise (fiir 0<d< 27) 


{F(0) = ari (9) + agro (9), 
(1.2.34) i 
| TO) = deli (Wi az r(9) ; 


wobei die Bezeichnung der Koeffizienten durch Fortlassung von Indizes 
etwas vereinfacht wurde. In einem hinreichend kurzen Intervall der GréBe # 
ist daher immer noch eine Darstellung der Bahn in der urspriinglichen Form 
(1.2.32) méglich, wobei der momentane Wert der Gròfe È gegeben ist durch 


(1.2.35) = ga, +a, . 


(22) Ahnliche Rechnungen wurden unabhingig von J. 8. BELL: A.E.R.E. T/M 86, 
durchgefiihrt. Eine vereinfachende Umformung des Resultats verdankt Verf. einer 


Korrespondenz mit Herrn BELL. 
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Da sich die Phase von 7, n* (nicht jedoch der Betrag) wahrend der Umlaufe 
indert, schwankt |£| schwebungsartig zwischen einem Maximalwert | EI 
und einem Minimalwert |é 


min 


mio = 171 }|a]—]az]]; 


(1.2.36) = ]E|,.=|m](fax]+|ae}), | 


und fiir das Verhiltnis beider Werte, das als Schwebungsfaktor Y bezeichnet 
sei, erhalt man 


GS ul + | de/a, | 


(253%) P= lac a ola hs 


Nach der oben ausgesprochenen Annahme gibt # zugleich das Verhaltnis 
zwischen gròBfter und kleinster Amplitude — jeweils verstanden als Maximal- 
amplitude iiber eine gréBere Zahl von Betatronwellenlangen — wieder. 

Das Verhàltnis |a,/a,| ist aus Gl. (1.2.18) zu entnehmen; man findet 


|_| Mae + 2riW9®| 
| Qy | | Nos 


, 


wobei Qu positiv gewahlt sei. Die GròBen N,, und N,, sollen in dieser Gleichung 
durch andere, den instabilen Bereich kennzeichnende GròBen ersetzt werden. 
Zunàchst gilt nach Gln. (1.2.26) und (1.2.31) 


(1.2.39) Ni, = N — 2a WOQ = Qi W (SQ, — 6Q™) . 


2 mitt 


Ferner werde statt dQ der Abstand 6Q°, von der nàchsten Grenze eines 


instabilen Bereichs eingefiihrt, fiir den offenbar gilt (23) 
(1.2.40) 69” = 30%, + 4500, + 59%. 


Verwendet man im Nenner von Gl. (1.2.38) noch Gl. (1.2.30), so erhàlt man 


lan _ dQ0 +2(602, +00) 
NICO Or i 


Fur den Schwebungsfaktor folgt damit 


(1.2.41) 


‘eee pu SOR + 808, +O 
00% + Ge 
(2) Die ist zunàchst richtig, falls man sich oberhalb und nicht unterhalb eines 
instabilen Bereichs befindet. Der Leser tiberzeugt sich jedoch leicht, daB Gl. (1.2.42) 
richtig bleibt, wenn man sich unterhalb eines instabilen Bereichs befindet; alle auf- 
tretenden Groen sind dann durch ihre Absolutbetràge zu ersetzen. 
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SchlieBlich mége noch die GréBe Q™ mittels der aus Gl. (1.2.28) folgenden 
Beziehung 


(1.2.43) du = 690 600, + (5Q®,,)? 


eliminiert werden. Nach einigen Umformungen erhàlt man dann 


ee (1) 
ing inst 
(1.2.44) pa ee La @ tl SEL @ 


Qua Bi do st 


~ 


wobei der rechts stehende Naherungsausdruck in umittelbarer Nahe des insta- 
bilen Bereichs giiltig ist. Der Schwebungsfaktor F ist in Abb. 5 dargestellt. 
Er geht an der Grenze des instabilen Bereichs 
gegen unendlich, wie zu erwarten war. Die 
Umgebung des instabilen Bereichs in der eine 
wesentliche Aufblahung des Strahls erfolgt, 
ergibt sich als klein gegen die Breite des 
instabilen Bereichs selbst. Die Flanken spielen 
daher praktisch keine groBe Rolle. 

Von gròBerem praktischen Interesse als der 
hier berechnete (« statische ») Schwebungsfaktor Abb. 5. - Der Schwebungs- 
ist das Verhaltnis der maximalen Amplitude, faktor F in der Umgebung 
nicht zur minimalen Amplitude , sondern zur eines instabilen Bereichs nach 
Amplitude der Betatronschwingungen im Augen- Gl. (1.2.44). 
blick der Injektion. Nimmt man an, daf die 
Injektion in hinreichendem Abstand von den benachbarten instabilen Bereichen 
erfolgt und daB man sich adiabatisch der Grenze eines solchen instabilen 
Bereichs nàhert, so ergibt sich dieser «dynamische » Schwebungsfaktor als 
Quadratwurzel aus dem oben berechneten statischen. Dabei ist die «adia- 
batische Dampfung » (vgl. Anh. 1b) unberiicksichtigt; sie fihrt zu einer Ver- 
ringerung aller Schwingungsamplituden im Verlauf des Beschleunigungsprozesses. 

Die Zahl % der Umliufe zwischen zwei Schwebungsmaxima folgt aus der 


Bedingung, daB gelten mu8 


(1.2.45) 2kQY = 1. 


In umittelbarer Nahe eines instabilen Bereichs hat man daher nach Gl. (1.2.43) 


wR 


(1.2.46) Kees AVATI 


ton O Quest 
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3. — Versehiedene Probleme. 


a) Gleichzeitige Wirkung von Fehlern im Fùhrungsfeld und im Feldindex. - 
Im aligemeinen werden beide in den Abschnitten 1 und 2 getrennt analysierten 
Typen von Fehlern zusammen auftreten; man hat dann sowohl eine geanderte 
Gleichgewichtsbahn als auch instabile Bereiche. Die Betatronschwingungen um 
die geinderte Gleichgewichtsbahn werden durch die Fehler im Fuhrungsfeld 
nicht beeinfluBt. Fiir sie gelten also weiterhin die Uberlegungen des Abschn. 2. 
Auf die Gestalt der geinderten Gleichgewichtsbahn wirken sich dagegen beide . 
Typen von Fehlern aus. 

Zunaichst moge beachtet werden, daB bei Ableitung der Gl. (1.1.8) fur die : 
geinderte Gleichgewichtsbahn zwar von der Periodizitàtsbedingung (1.1.4) fur » 
einen vollen Umlauf, nicht aber von der stàrkeren Periodizitàtsbedingung ; 
(E.10) Gebrauch gemacht wurde. Gl. (1.1.8) gilt daher auch fur ein Synchrotron, , 
in dem aufer Fehlern-im Fiihrungsfeld solche im Feldindex auftreten, nur; 
sind dann statt der Floquetlésungen 7,(0), r:(2) des fehlerfreien Beschleunigers ; 
die Lésungen 7,(%), 7:(?) gemaB Gl. (1.2.1) zu benutzen; entsprechend ist die » 
Wronskideterminante W durch die aus diesen Lòsungen gebildete Wronski- 
determinante 


ey WF (O)7F(I) — TOTO) 


zu ersetzen. Man erhalt damit folgende Gleichung fùr die geanderte Gleich- 
gewichtsbahn unter dem EinfluB beider Typen von Fehlern 


9 
A 1 es 7,(9) 72(9')7(9) 

lei Ww 
r(?) a dd" 6f(9") ar = a 2ri9] 1--exp[+2zi9]J © 


Oo -—27 


(1.3.2) 


Wie man sieht, tritt Resonanz, d.h. ein formales Unendlichwerden der Ampli-+ 
tude der Gleichgewichtsbahn, auf fiir ganzzahliges Q, also an den Grenzeni] 
eines in der Nahe ganzzahligen Wertes von @ liegenden instabilen Bereichs.. 
An den Grenzen jeden instabilen Bereichs gehen 7,(9) und 7,(9) in ein un 
dieselbe periodische oder antiperiodische Lòsung P (0) tber (2); da dann di 
Wronskideterminante W verschwindet, wide man zunàchst Resonanz deri 
geanderten Gleichgewichtsbahn an den Grenzen aller instabilen Bereiche er-| 
warten. Eine genauere Untersuchung wird jedoch gleich zeigen, daB bei halb-| 
ganzem @ keine Resonanz der Gleichgewichtsbahn auftritt. Die geàndert 


(SIE Unterabschn. a der Einleitung, insbesondere Gl. (E.13). Daf die dort 
vorgetragenen Uberlegungen sich zundchst auf die instabilen Bereiche fiir eine einzelna 
Periode des Feldindex bezogen, ist mathematisch ohne Bedeutung. 
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Gleichgewichtsbahn existiert mathematisch auch innerhalb der instabilen Be- 
reiche; praktisch ist das allerdings ohne Interesse, da die Betatronschwin- 
gungen um diese Bahn herum dort nicht stabil sind. 

Es soll das Verhalten der geainderten Gleichgewichtsbahn in der Umgebung 
eines instabilen Bereichs mit halbganzem Wert von Q untersucht werden. 
In der Nahe einer solchen Bereichsgrenze kann niherungsweise gesetzt werden 


(1.3.3) 1 — exp [+ 2riQ] = 2°. 


Daher folgt in diesem Fall aus Gl. (1.3.2) 


d 
A 1 fs 
(1.3.4) (9) = se | dd! df’ (B72 I) FINTO) - 
O— 27 


Jetzt steht im Zahler ebenso wie im Nenner eine aus den Lésungen 7,(%), 7(%) 
gebildete Determinante. Ein Quotient zweier solcher Determinanten Andert 
seinen Wert nicht, wenn zu irgend zwei anderen linear unabhingigen Lésungen 
ubergegangen wird und die Tatsache, daB zwei spezielle Lòsungen im Grenz- 
fall (Q — nachste halbganze 
Zahl) gegen ein und dieselbe ] 
Funktion konvergieren, ist 
ohne Belang. 

In Abb. 6 ist das Zusam- 
menwirken der Fehler qua- 
litativ dargestellt. Zunachst 
sind die instabilen Bereiche 
eingezeichnet, die bei kleinen 
Fehlern des Feldindex in der 
Nahe ganz- oder halbzahliger 
pete VI a Abb. 6. — Schematische Darstellung der Auswirkung 
ARA e Aa von Fehlern bei Anderung von Q infolge Anderung 
faktor F (Unterabschnitt 2c, des idealen Feldindex: instabile Bereiche (die beiden 
insbes. Gl. (1.2.44))  auf-  duBeren fiir ganzzahliges Q), Amplitude der geinderten 
getragen, der an den Grenzen Gleichgewichtsbahn, Schwebungsfaktor. 
all dieser instabilen Bereiche 
gegen unendlich geht. Nach oben ist die Amplitude der geainderten Gleich- 
gewichtsbahn eingezeichnet, die nur an den Grenzen der bei ganzzahligem (A) 
liegenden instabilen Bereiche gegen unendlich geht, an den Grenzen der anderen 
Bereiche aber kein ungewohnliches Verhalten zeigt. Die Betatronschwingungen 
mit ihren Schwebungen erfolgen um die geinderte Gleichgewichtsbahn herum. 
Der Abstand, den man von einem instabilen Bereich bei ganzzahligem @ einhalten 
muB, wird meist durch die maximal zulàssige Auslenkung der geànderten Gleich- 
gewichtsbahn bestimmt sein; bei den instabilen Bereichen mit halbganzem @ 


1106 G. LUDERS 


ist der einzuhaltende Abstand durch den Schwebungseffekt und die damit 
verbundene Aufblihung des Strahls gegeben. Ferner wird zu beachten sein, 
daB auf Teilchen mit Impulsfehler ein etwas gedinderter Feldindex wirkt; 
hierauf hat TwIss hingewiesen (siehe Anh. 1a). 

Bei dem bisherigen qualitativen Uberblick wurde von Naherungsausdricken 
fiir 7,(9), 7:(9), Q kein Gebrauch gemacht. Jetzt sollen bei der Untersuchung 
des Resonanzverhaltens die Ergebnisse der Stòrungsrechnung des voran- 
gehenden Abschnitts benutzt werden; die folgenden Resultate sind daher nur 
fiir hinreichend kleine Fehler des Feldindex richtig. Setzt man in Gl. (1.3.1) 
fiir 7,(9), 7(%) die Ausdriicke (1.2.34) ein, so folgt zunachst 


(1.3.5) W=(\a,|?— 


a,|°)W. 


Ferner kann im Nenner von Gl. (1.3.2) gesetzt werden 


(1.3.6) _ 1-exp[t27i0] ~ F 2niQ. 


Im Zahler dagegen darf man meist setzen 


(eon) FO) AD)  Fo(O')K (9) w FE (OF 


per sll 


d), 


wobei 7,,.(9) die periodische Lésung (7;(9) umittelbar auf der Bereichsgrenze) 
bedeutet. Nur falls das hier auftretende Integral extrem klein ist oder ver- 
schwindet, ist eine sorgfaltigere Behandlung erforderlich; vgl. hierzu den 
SchluB dieses Unterabschnitts. Man erhàlt mit (1.3.5), (1.3.6) und (1.3.7) fir 
die geanderte Gleichgewichtsbahn in der Nahe eines ganzzahligen Q nàherungs- 


weise 


(11(9) + (a2/a1)r2(P)) JOFO (at ]at)r (0) + 7,(8')) dd 


di => — 
(1.3.8) iO) = Walaa oe 


Das Herausdividieren von |a,|® ist erlaubt, obwohl diese GròBe in Zabler 
und Nenner nicht genau identisch ist. In W bezieht sie sich nimlich auf einen 
bestimmten Arbeitspunkt in der Nahe der Grenze des instabilen Bereichs, in 
7,,.(0) dagegen auf diese Grenze selbst; die Unterschiede sind aber von héherer 
Ordnung. 

Aus Gl. (1.2.41) folgt, daB der Betrag von a,/a, an der Grenze des insta- 
bilen Bereichs gleich eins ist. Es kann also im Zàhler der GI. (1.3.8) gesetzt 
werden 


(1.3.9) SERA 
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wobei e eine komplexe Zahl vom Betrag eins darstellt. Fiir den Nenner werden 
nun Gl. (1.2.41) und Gl. (1.2.43) benutzt. Somit erhilt man schlieBlich fiir 
QRL OO. 


inst 


(71(8) + ers(9 (0) 340 )(e*r (8) + 72(8") dd" 


1.3.10 9)= 
(SE so “ai WOO, 


wobei der Nenner also linear mit 60), geht. Die entstehende Gleichung ist 
sehr ahnlich zu dem Grenzfall der Gl. (1.1.8) (Gleichgewichtsbahn in einem 
Teilchenbeschleuniger ohne Fehler des Feldindex) in der Nahe eines ganz- 


zahligen Wertes von @ 


SILA O) + (02 (0)) 
ica BY) Ss aniW 50 


Der Betrag des Faktors von ¢,() ist ein MaB fiir die Amplitude der geànderten 
Gleichgewichtsbahn. Ohne Fehler im Feldindex ist dieser Faktor gegeben durch 


(271 W 6Q)- foro") 0) 60, bei Anwesenheit von Fehlern dagegen durch 
(221 W 609.) fen ') + e*r,(9')) dd’, wobei fon) )r,(9') dd" und [on 


353) 


(9) dd An komplex sind. Je nach ae Phase von ¢ hat man im 
zweiten Fall in einem bestimmten Punkt eine gréfere oder kleinere Amplitude 
als im ersten. GréBenordnungsmaBig ergibt sich so, daB sich die Breite des 
gefàhrlichen Bereichs in der Umgebung einer Resonanz ungefàhr additiv zu- 
sammensetzt aus der Breite des instabilen Bereichs und der Breite des Be- 
reichs zu groBer Auswanderung der Gleichgewichtsbahn, wie er sich allein aus 
den Fehlern des Fiihrungsfeldes errechnen wuùrde. 

Falls fiir die Floquetschen Lòsungen r,(9), r,() im idealen Beschleuniger 
an einer Resonanzstelle (Q ganzzahlig) 


(1.3.12) I df(9') fees aw’ = 0 


0 


gilt — etwa wegen strenger Periodizitàt von df(#) mit &, —, so reicht die 
vereinfachte Gl. (I.3.10) nicht aus. Man hat dann unter dem Integral fur 
7.0) die erste stòrungstheoretische Naherung einzusetzen, d.h. nach Gl. (1.2.13) 


zu ersetzen 


d 


(13:13) foro Vo ) ad = Java ) faa On(O Foor(O’) {rx (9°) ro (9) ro (8) (9)}, 


ve 


1108 G. LUDERS 


wobei 7,,,(0) auf der rechten Seite wieder durch den Ausdruck nullter Naherung 
darzustellen ist. Diese Uberlegung gilt speziell fiir das Verhalten der synchronen 
Bahn unter dem EinfluB von Fehlern im Feldindex. 


b) Ahnlichkeitsgesetee. — Zum Verstàndnis allgemeiner funktionaler Zu- 
sammenhinge, insbesondere solcher, die in Teil II auftreten werden, ist es 
zweckmibig, den Begriff der « àhnlichen Anordnungen » einzuftihren. Da ein 
Periodizitàtsbereich des Feldindex in einem Synchrotron mit starker Stabili- 
sierung — wenigstens, wenn man sich auf die Untersuchung einer einzigen 
Schwingungsrichtung beschrànkt —, gekennzeichnet ist durch den Verlauf der 
Funktion n(8) im Intervall 0 bis 3, sollen zwei Anordnungen als àhnlich be- 
zeichnet werden, wenn die Feldindizes auseinander hervorgehen durch Multi- 
plikation mit einem konstanten Faktor und Ahnlichkeitstransformation der 
0-Achse. In Formeln: Es seien n!(9), n°®(3) die Feldindizes der beiden Anord- 
nungen; die Anordnungen sollen 4hnlich genannt werden, wenn es zwei Kon- 
stanten A und x gibt, so da8 


(1.3.14) nd) = An (xd) 


fir 0<9 <0 = 9. In diesem Sinne sind also z.B. alle Anordungen zu 
einander 4Ahnlich, bei denen jeder Periodizitàtsbereich aus zwei Sektoren 
gleicher Lange mit einem Feldindex von gleichem Betrag aber entgegenge- 
setztem Vorzeichen besteht (fiir solche Anordnungen galt Gl. (E.15)). Es sei 
betont, daB sich alle unsere Ahnlichkeitsiiberlegungen auf fehlerfreie Anord- 
nungen und nicht auf Fehler beziehen. 

Die Sprechweise « zwei àhnliche Anordnungen ftthren zur gleichen Schwin- 
gungsform » soll bedeuten, daf sich zu jeder Lòsung r(#) der einen Anordnung 
eine andere r(#) fiir die zweite Anordnung finden laBt, so daB gilt 


(1.3.15) HOP) = (G09) 
Diese Bedingung bedeutet, dafi die Betatronschwingungen in beiden Synchro- 
trons in gleicher Weise verlaufen. Aus der Differentialgleichung (E.6) fiir die 


freien Betatronschwingungen folgt, daB zwei ahnliche Anordnungen dann (und 
ubrigens nur dann) zur gleichen Schwingungsform fiihren, wenn gilt 


(1.3.16) i aA) We 


Wegen der fir den Periodizitàtswinkel 9, giiltigen Beziehung 


_ 8 


317) go 
x 
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kann Bedingung (1.3.16) auch so formuliert werden, daB die GréBe n*9, fir 
alle zur gleichen Schwingungsform fiihrenden Ahnlichen Anordnungen den 
| gleichen Wert hat. Das Zeichen » soll hier und auch spàter den Feldindex an 

einer bestimmten Stelle, also etwa bei 3 = x3,, bedeuten (x ist eine Konstante). 
Der Periodizitàtswinkel 3, kann nach Gl. (E.8) durch die Periodenzahl M aus- 
gedrickt werden. Damit gilt, daB ahnliche Anordnungen zur gleichen Schwin- 
gungsform fiihren, wenn die GròBe n*/M fiir beide Anordnungen dieselbe ist. 
Insbesondere kann daher die GròBe 4 (Gl. (E.11)) fiir Ahnliche Anordnungen 


nur von n°/M abhangen: 


nt 


(1.3.18) t= FT) 


Ein spezielles Beispiel stellt Gl. (E.15) dar. 
Aus Gl. (1.3.18) folgt durch Differentiation 


du 1 nn? _, {nt 
(1.3.19) n= PF (i) 


Da die rechte Seite wieder nur von n*/M abhingt, ist also der Quotient aus 
der Anderung von w und der relativen Anderung des Feldindex fiir ahnliche 
Anordnungen gleicher Schwingungsform stets gleich. Fiir die Anderung der 
Zahl der Betatronwellenlangen auf dem Umfang folgt hiermit unter Benutzung 


von Gl. (E.12) 


z 4 
(1.3.20) ARDA 2 (3 i 


Sen M 


Die Dichte der Resonanzstellen, gemessen in relativen Anderungen des Feld- 


. index, ist also umgekehrt proportional zu n°. 
Trifft man fir die Lésungen r©(9), r©(?) die Zuordnung (1.3.15), so gilt 


ersichtlich fiir die Ableitungen 
IREIZADI ri (9) = rer (xB) 
und entsprechend fiir die aus zwei Lòsungen gebildete Wronskideterminante 


(1.3.22) WOo=xW®. 


Von mehr Interesse sind Bildungen, die von der Normierung der Lòsungen 


unabhingig sind. Seien 7(9), r(9), + (2), r2(9) zwei entsprechende Paare 


von Lésungen und seien W, W die aus diesen Lésungen gebildeten Wronski- 
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determinanten, so gilt ersichtlich 


L 
HP (aD) (BOY) = 


= aw PO) 1280) 


(1.3.23) a 


(0<a,B<1). Die GròBe (n*/W)r,(x9o)r2(B%) ist also invariant fir ahnliche 
Anordnungen, die zur gleichen Schwingungsform fiihren; fiir ihnliche Anord- 
nungen im allgemeinen hangt sie daher nur von n*/M ab. Fir r;(9) und 7,(0) 
kann man hier speziell die Floquetschen Lésungen einsetzen. 

Aus der Transformation der Winkel 2, die in Gl. (1.3.17) speziell fur den 
Periodizitàtswinkel 3, angegeben wurde, folgt (in symbolischer Schreibweise) 
fiir Integrale 


2 1) 
oY di 


(1.3.24) fave = fon) : 


0 


Die Beziehung bleibt richtig, wenn auf beiden Seiten tiber denselben Bruch- 
teil des Periodizitàtsbereichs integriert wird. Unter dem Integral der linken 
Seite sollen GròBen der Anordnung 1, unter dem der rechten Seite die ent- 
sprechenden der Anordnung 2 stehen. Fur die unter dem Integral stehenden 
Ausdriicke sind dabei Beziehungen vom Typ (1.3.23) zu benutzen. Symbolisch 


do 
folgt aus Gl. (I.3.24), daB n} | dé... fiir àhnliche Anordnungen nur von der die 
0 


Schwingungsform kennzeichnenden GròBe n*/M (baw. von 4) abhangt. 

Eine Reihe von Anwendungen dieser Uberlegungen werden in Teil II 
auftreten. Hier soll nur eine Anwendung auf die synchrone Bahn (vgl. Unter- 
abschn. le) erfolgen. Auf der rechten Seite der Gl. (1.1.17) stehen nàmlich 
genau Integrale der eben betrachteten Art. Falls 0(%)/R nur von der relativen 
Lage zwischen 0 und %, tir è = a, also nur von « abhingt, folgt somit, 
daB 7.nee(0)/R fiir Ahnliche Anordnungen gleicher Schwingungsform bei glei- 
chem dp/p umgekehrt proportional zu n ist. Ebenso folgt aus Gl. (1.1.19), daB 
der Quotient aus relativer Wegverlàngerung und relativem Impulsfehler um- 
gekehrt proportional zu n ist. Als Spezialfall vergleiche man etwa die in CLS, 
Gln. (8) und (9) angegebenen Beziehungen. 


4. — Zusammenfassung von Teil I. 


Die mathematische Analyse hat die qualitativen Uberlegungen des Unter- 
abschnitts b der Hinleitung bestatigt. Es wurden zwei Typen von Abweichungen 
eines Teilchenbeschleunigers mit starker Stabilisierung vom idealen Entwurf 
betrachtet, nimlich Abweichungen im magnetischen Fiihrungsfeld auf der 
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- Gleichgewichtsbahn (Abschn. 1) und im Feldindex (Abschn. 2). Der dritte Typ 
linearer Fehler, nàmlich eine Verwindung der Magnete gegeneinander, wurde 
hier nicht behandelt. 

Ein ideales Synchrotron mit starker Stabilisierung besteht aus einer Anzahl 
streng identischer Periodizitàtsbereiche des Feldindex mit einer in sich zuriick- 
laufenden Gleichgewichtsbahn. Fiir einen solchen idealen Beschleuniger gilt 
die Theorie der starken Stabilisierung in ihrer urspriinglichen Formulierung 
(Unterabschn. a der Einleitung). Treten nur Fehler im Fithrungsfeld, jedoch 
nicht soleche im Feldindex auf, so gibt es in dem Beschleuniger eine abgein- 
derte Gleichgewichtsbahn, deren Abweichung von der urspriinglichen Gleich- 
gewichtsbahn umso gròfer wird, je mehr sich fiir den zugeordneten idealen 
Beschleuniger die Zahl der Betatronwellenlangen auf dem Umfang einer ganzen 
Zahl naàhert. Die Gleichung dieser geinderten Gleichgewichtsbahn ist durch 
Gl. (1.1.8), mit df(?) nach Unterabschn. 6 der Einleitung, gegeben. Fehler im 
Feldindex fiihren zu instabilen Bereichen in der Nahe solcher Arbeitspunkte, 
die im idealen Beschleuniger eine ganze oder halbganze Zahl von Betatron- 
wellenlangen auf dem Umfang liefern. Die fiir die Bereiche kennzeichnenden 
GròBen (ihre Lage und Breite) sind in erster stòrungstheoretischer Naherung 
in Gln. (1.2.30) und (I.2.31) angegeben. In der Nahe solcher instabilen Be- 
reiche zeigen die Betatronschwingungen Schwebungen, ausgedriickt durch den 
Schwebungsfaktor # in Gl. (1.2.44) .Gleichzeitige Anwesenheit von Fehlern 
im Fihrungsfeld und im Feldindex tilhrt zu einem Unendlichwerden der Ampli- 
tude der geinderten Gleichgewichtsbahn an den Grenzen derjenigen instabilen 
Bereiche, bei denen die Zahl der Betatronwellenlangen auf dem Umfang eine 
ganze Zahl wird (Abschn. 3a). Instabile Bereiche, Schwebungsfaktor und ge- 
anderte Gleichgewichtsbahn sind in Abb. 6 qualitativ dargestellt. 


Il. — Statistische Auswertung. 


Die in Teil I abgeleiteten Gleichungen erlauben, die Auswirkung bekannter 
Fehler im Fiihrungsfeld und im Feldindex auf die Betatronschwingungen zu 
berechnen. Die Fragestellung, die bei der Konstruktion eines Synchrotrons 
mit starker Stabilisierung auftritt, ist jedoch eine andere: es interessiert, Aus- 
wirkungen welcher GréBe (Auslenkung der geinderten Gleichgewichtsbahn, 
Breite der instabilen Bereiche usw.) man bei dem noch nicht gebauten Be- 
schleuniger zu erwarten hat. Hs ergibt sich also ein statistisches Problem; aus 
begriindeten Annahmen iiber die GròBenverteilung der einzelnen Beitràge zu 
den Fehlern und ihrer gegenseitigen statistischen Abhangigkeit sind statistische 
Aussagen iiber die GréBe der auftretenden Storungen zu gewinnen. 

Teil II ist den dabei auttretenden Problemen gewidmet. In Abschn. 1 
werden Mittelwerte (oder Erwartungswerte) fiir die interessierenden GròBen 


ie G. LUDERS 


berechnet. Ein wirklicher Teilchenbeschleuniger ist jedoch weder mit dem 
idealen Entwurf identisch — das eben ist die Auswirkung der betrachteten 
Stérungen —, noch besitzt er genau «mittlere» Fehler. Die auftretenden 
Auswirkungen der Fehler kénnen deshalb sowohl gròBer als kleiner als die in 
Abschn. 1 zu berechnenden Mittelwerte sein. Darum erfolgt in Abschn. 2 eine 
Untersuchung der Wahrscheinlichkeitsverteilungen. 


1. — Mittelwerte. 


a) Gednderte Gleichgewichtsbahn, Vorbemerkungen. — Der statistische Cha- 
rakter der Fragestellung soll zunàchst am Beispiel der Fehler im magnetischen 
Fiihrungsfeld genauer verdeutlicht werden (#5). Versucht man, denselben Ent- 
wurf eines Synchrotrons in einer groBen Zahl von Exemplaren zu verwirk- 
lichen, so werden die Fehler, z.B. Aufstellungsfehler, der einzelnen Verwirk- 
lichungen im allgemeinen verschieden sein. Auf diese Weise kann man 
eine Gesamtheit von Teilchenbeschleunigern erzeugt denken, die samtlich 
Verwirklichungen desselben idealen Entwurfs darstellen. Ein Mitglied dieser 
Gesamtheit ist der Beschleuniger, den man bauen wird; man wei 
jedoch nicht, welches. Ein gewisses Mai fur die zu erwartenden Stòrungen 
wird man bekommen, indem man aus dieser Gesamtheit einen « mittleren » 
Beschleuniger herausgreift. Genauere Aufschlisse erhalt man, wenn man die 
Wahrscheinlichkeitsverteilung fiir die GròBe der Fehlerauswirkungen aufstellt. 

Vor Konstruktion der Gesamtheit, iber die dann die Mittelbildung aus- 
zufiihren ist, miissen die statistischen Eigenschaften der einzelnen Fehler unter- 
sucht werden. Es seien etwa die radialen Aufstellungsfehler betrachtet. Die 
Aufstellung kann beispielsweise so erfolgen, dab Abstand und Richtung jedes 
einzelnen Magneten in Bezug auf den Mittelpunkt des Beschleunigers fest- 
gelegt werden, wobei die Vermessung natiirlich nur mit einer endlichen Genauig- 
keit und einer bestimmten Fehlerverteilung vorgenommen werden kann. In der 
Gesamtheit von Beschleunigern sind dann, wenigstens soweit es sich um diesen 
speziellen Fehler handelt, die Aufstellungsfehler der einzelnen Magnete von 
einander statistisch unabhangig und die relative Haufigkeit der einzelnen Fehler 
ist durch die Fehlerverteilung des Vermessungsprozesses gegeben. Nimmt man 
die Justierung der einzelnen Magnete nicht relativ zum Mittelpunkt des Be- 
schleunigers sondern teilweise relativ zu den Nachbarn oder zu einzelnen Hilfs- 
punkten vor, so sind die Aufstellungsfehler der einzelnen Magnete nicht mehr 
innerhalb der Gesamtheit statistisch unabhangig von einander. Das wirkt sich 


(2°) Unseres Wissens war Lawson (vgl. FuBn. (1!)) der erste, der eine statistische 
Aussage iiber den FintluB von Fehlern auf den Betrieb des Synchrotrons mit starker 
Stabilisierung machte. 
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auf die Berechnung von Mittelwerten und Wahrscheinlichkeitsverteilungen 
naturlich aus. 

Bei gegebenen Fehlern im magnetischen Fuhrungsfeld, ausgedriickt durch 
die Funktion of(#) (vgl. Unterabschn. d der Einleitung), làBt sich die 
Abweichung (9) der geanderten Gleichgewichtsbahn von der idealen 
Gleichgewichtsbahn mittels Gl. (I.1.8) berechnen. Da der Teilchenstrahl wihrend 
des ganzen Beschleunigungszyklus innerhalb der Vakuumkammer bleiben muB, 
interessiert die maximale Auslenkung dieser geiinderten Gleichgewichtsbahn. 
Diese Auslenkung muB stets kleiner bleiben als der Halbmesser des 
Vakuumrohres, da um die Gleichgewichtsbahn herum die freien Betatron- 
schwingungen erfolgen. Bei der horizontalen Schwingungsrichtung ist auBer- 
dem zu beachten, daf die Synchrotronschwingungen (vgl. Teil I, Abschn. 1e) 
zu einer zusàtzlichen horizontalen Auslenkung fiihren. 

Fur die statistische Untersuchung der Auslenkung der geinderten Gleich- 
gewichtsbahn ware eigentlich folgendes Verfahren wiinschenswert: Fiir jedes 
Mitglied der Gesamtheit, die entsprechend der statistischen Higenschaften der 
einzelnen Fehler konstruiert ist, berechnet man nach Gl. (1.1.8) die maximale 
Auslenkung der Gleichgewichtsbahn. AnschlieBend ermittelt man hieraus den 
Mittelwert dieser Maximalauslenkung — gemittelt tiber die Gesamtheit von 
Beschleunigern —, die Wahrscheinlichkeitsverteilung dieser Grobe usw. Die 
Durchftihrung dieses Programms wiirde recht schwierig sein, im wesentlichen 
deswegen, weil sich der Gl. (1.1.8) nicht auf einfache Weise Lage und Wert 
des Maximums ansehen lassen. Es empfiehlt sich deshalb ein anderes, im Ein- 
zelfall weniger genaues Verfahren, das jedoch etwa dieselben Aufsehlùsse ùber 
die zu erwartenden Auswirkungen der Fehler gewinnen laBt. 

Zunichst ist nach Gl. (1.1.8) sicherlich die folgende Ungleichung zutrefiend 


A ae | {o}(d' ‘do’ max 
Et) IP Tmax < |W|- si a A 
wobei benutzt wurde 
(II.1.2) |1 — exp [—2xiQ]|®= 2(1 — cos 270) = 4 sin? xf) . 


In Gl. (II.1.1) bedeutet |7,|,,,, den Maximalbetrag der Funktion r,(9), der in 
jedem Periodizitàtsbereich des Feldindex (mindestens) einmal angenommen 
wird. In Anordnungen, bei denen jede Periode aus zwei Sektoren konstanten 
Feldindexes besteht, wird dieses Maximum aus Symmetriegrinden (fiir die 
horizontale SI NA > der Mitte des Sektors mit negativem Feld- 
piss angenommen. | ofl | | of v/a bedeutet das Maximum von 


max 


fono) ‘)d9'|, das jeweils fiir ein gewisses, von Beschleuniger zu Be- 
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schleuniger verschiedenes, 9 angenommen wird. Durch GI. (II.1. s wird die Un- 


tersuchung des Maximums von |#| auf die des Maximums von ‘fone ‘Vr (B') dd" | 
J 27 


zuriickgefiihrt. Die Maximalamplitude wird in jedem einzelnen Fall damit 
moglicherweise iiberschatzt, aber nie "Ra 

Die Untersuchung von | [910 d')dd'| soll in zwei Teilen erfolgen. Zu- 
nàchst (Unterabschn. b) werde Ar pe an einem beliebig herausgegriffenen 
Punkt der Bahn berechnet und dann eine geeignete Mittelung ilber die Ge- 
samtheit von Beschleunigern durchgefiihrt. Durch diese verhaltnismafig ein- 
fachen Uberlegungen wird bereits eine Aussage iber die Auslenkung der ge- 
inderten Gleichgewichtsbahn gewonnen, die umso zuverlàssiger ist, je mehr 
man in su Nahe von Resonanzbedingungen (Q ganzzahlig) arbeitet. Das 


Integral [or df(d d')dd' ist dann nàmlich von dem Winkel è praktisch un- 
9-27 

abhingig, da die Funktion r,(?) nahezu periodisch mit der Periode 2z ist. 

Man hat in der Nahe kritischer Bedingungen als geànderte Gleichgewichts- 


bahn fone eine in sich zurticklaufende Betatronschwingung. Die Schwan- 


kung von [yo 1(9')dd'| entlang des Umfangs spielt eine gròBere Rolle, 
d-27 
falls man sich ziemlich weit von Resonanzbedingungen entfernt befindet. 


d 
Infolge dieser Schwankungen kann dann das Maximum von | {6{(91)r:(91) dd" 
s d-27 
merklich gròBer sein als der Wert in einem beliebig herausgegriffenen Punkt. 


Eine Untersuchung der Schwankungen der genannten GròBe entlang des Um- 
fangs des Beschleunigers erfolgt in Unterabschn. e. 


b) Gednderte Gleichgewichtsbahn, Vernachlissigung der Amplitudenschwan- 
kung entlang I des Umfangs. — Nach den Uberlegungen von Unterabschn. a ist 
die GròBe | df(d 9)dd' fir einen festen Punkt auf dem Umfang, etwa 


na 27 
d = 2a, zu berechnen. Es werde hierbei die Abkirzung (2°) 


(12.1.3) F=  6f(9)r,(9) ad 


(2°) Eine Verwechslung dieser GréBe F mit dem Schwebungsfaktor in Teil I, 
Abschn. 2c ist wohl nicht zu befiirchten. 
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eingefiihrt. Ferner definiere man einen Mittelwert re » durch (27.28) 


max 


11.1.4 SII 
<Tinax? [W][sinaQ] * 


Gl. (II.1.4) wurde eine bessere Abschàtzung fiir die Auslenkung der geànderten 
Gleichgewichtsbahn geben, wenn man hier statt F an der willkiirlichen Stelle 
?= 27 den jeweiligen Maximalwert einsetzen wiirde. Nunmehr werde Ge- 
brauch gemacht von den Periodizitàtseigenschaften (E.10) der Funktion (3) 
und gesetzt 


(11.1.5) F = > Gm exp [i(m —1)y], 
m=l 
wobei 
MI 
(II.1.6) Pm = | d(d)(r,(9) exp [— i(m — 1)u]) dd 
(m—1)h 


ein Integral tiber den m. Periodizitàtsbereich des Feldindex bedeutet. Die in 
Gl. (11.1.6) unter dem Integralzeichen stehende Funktion 7, (9) exp [—i(m — 1)u] 
ist so gewahlt, daB sie von der Nummer (« m ») des betreffenden Periodizitats- 


bereichs unabhangig ist. 
Es werde nun angenommen, daf positive und negative Fehler df(?) gleich 


wahrscheinlich sind, daB also gilt 


(11.1.7) Pm) = pr) = 0. 


AuBerdem sollen diese Fehler in allen Periodizitàtsbereichen in gleicher Weise 
auftreten und Fehler in verschiedenen Periodizitàtsbereichen statistisch un- 
abhangig von einander sein; man hat dann 


2% 
=» 
7 


(11.1.8) (Pm Pi > = Smik|P 


(27) Eine Verbesserung dieser Definition (d.h. eine bessere Anpassung an das wirk- 
liche Problem) wird nach Unterabschn. e erreicht, indem man 1/|sin 7Q| ersetzt durch 
1/|sinaQ|+ e, wobei e eine Zahl der GròBenordnung eins darstellt. 

(28) Mittelwerte iber die gedachte Gesamtheit von Beschleunigern sollen, wie es 
in der englischsprachigen Literatur ùblich ist, durch spitze Klammern bezeichnet werden. 
Das ebenfalls gebriuchliche Zeichen eines Querstrichs tiber der betreffenden GroBe 
soll der Mittelbildung entlang des Umfangs eines einzelnen Beschleunigers vorbehalten 


werden (vgl. Gl. (II. 1.23)). 
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(Sn, ist das Kroneckersymbol). Aus Gln. (11.1.5), (11.1.7), (11.1.8), erhalt man 
(11.1.9) <|E|}> = M<|p|D- 


Mit der Definition (II.1.4) gewinnt man so schlieBlich 


A da VM |r1] max |9|dwmo 
(11.1.10) Tae? ar i | W | a \sin 0 | di 
Dabei wurde die Abkirzung 
(RSI) {lol Seme = Ve 


(WmQ = Wurzel aus dem mittleren Quadrat) benutzt. Man erkennt in 
Gl. (II.1.10) den Rescnanznenner, der jeweils bei ganzzakligem Wert von Q 
verschwindet. Das Auftreten von VM ist charakteristisch fiir die Annahme 
der statistischen Unabhiangigkeit der Fehler in den einzelnen Periodizitàts- 
bereichen. 

Die genauere Auswertung der Gl. (II.1.10) sei zunàchst fur Aufstellungs- 
fehler vorgenommen; die Voraussetzung statistischer Unabhangigkeit der 
Fehler in den einzelnen Periodizitàtsbereichen ist erfiùllt, wenn diese Bereiche 
unabhangig von einander relativ zum Mittelpunkt des Beschleunigers justiert 
werden. Es sei dariiber hinaus jetzt etwa angenommen, daB jeder Periodizitàts- 
bereich aus zwei Sektoren besteht, die einzeln relativ zum Mittelpunkt des 
Beschleunigers vermessen sind. Es kann sich hierbei beispielsweise um die 
beiden Stiicke konstanten Feldindex in einer zu Gl. (E.15) filnrenden Anord- 
nung handeln; von der Voraussetzung eines konstanten Feldindex wird jedoch 
im folgenden kein Gebrauch gemacht. Die Grò£fe g, (Gl. (I1.1.6)) spaltet aut 
in zwei Anteile 


(1) (2) 


| Pm uf Pm ta Pim ? 


(II.1.12) | qu = | Of(9)r,(9)d0, oP = | Of) r(9) dd , 


1. Sektor des 2. Sektor des 


m: Period. - Ber. m. Period. - Ber. 
und man hat 
ULSS) <|pl1> = <|@™ |) +<|p®]®). 


Der horizontale Aufstellungsfehler jedes einzelnen Sektors besteht aus 
einem Fehler dé in der Aufstellung des Mittelpunktes und aus einem Winkel- 
fehler 67; fiir die GréBe dÉ(2) in Gl. (E.16) hat man somit 


(II.1.14) 0&9) = dé + dn: RW —4F,,) , 
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wobei diese Gleichung innerhalb jedes einzelnen Sektors gelten und %,, die 
zum Mittelpunkt des Sektors gehòrige Koordinate # bedeuten soll. Nimmt 
man die Fehler dé und 67) als unabhingig von einander, ihre Verteilungen fiir 
beide Sektoren aber als gleich an, so ergibt sich schlieBlich (2°) 


V M IF] max f i 


lainn@ TW) dE? (Jac (9) a) + [nc royal) + 


1. Sektor 2. Sektor 


(II.1.15) 7 >) = 


max 


I nH) (9 — ,,) 7, (9) ag) + | [cava — Pm) 0 (0) asl | 


1. Sektor 2. Sektor 


1 


= <on*> | 


Bei einer feineren Unterteilung der Perioden des Feldindex in unabhàangig 
justierte Sektoren steht in der geschweiften Klammer eine entsprechend 
grofere Anzahl von Termen. 
Auf die hier auftretenden Integrale kònnen die Ahnlichkeitsgesetze aus 
Teil I, Abschn. 3b angewandt werden. Man findet, daB der Ausdruck 
Lal eax | W [*[ (8) (9) dd (mit Integration iber einen Sektor) fiir Ahnliche 
Anordnungen nur von n*/M abhingt, der Ausdruck E WI {n(9)(9 — Um): 

-r;(9) dd dagegen bei gegebenem n?/M proportional zu n? ist. Wenn man Win- 
kelfehler fiir den Augenblick vernachlassigt, so ergibt sich also bei festem n*/M 
der Mittelwert der Amplitude der geanderten Gleichwichtsbahn — genau- 
genommen im Sinne der Definition (IT.1.14) — proportional zu VM, d.h. (bei 
festem n?/M) zu n*. Dieselbe Abhangigkeit gilt fiir den Winkelfehler, falls 
nicht unmittelbar eine Winkelmessung sondern eine Messung der Koordinaten 
der Sektorenendpunkte erfolgt; fir hoéheres n werden die Sektoren kirzer und 
dadurch wird der oben genannte Faktor n? gerade kompensiert. Mit zunehmen- 
dem Feldindex nimmt demnach nicht nur die Dichte der Resonanzpunkte 
(proportional zu ni, vel. Gl. (1.3.20)) zu, auch die Amplitude der geanderten 
Gleichgewichtsbahn wird wegen der gròBeren Zahl unabhangig zu justierender 
Elemente gròBer. 

Nach diesen Uberlegungen kann Gl. (II.1.15) fir ahnliche Anordnungen 
folgendermaBen geschrieben werden (?°) 


E RE jsin 70 | L OE?) ae )) F< 0H n ton M ie 


(29) In Unterabschn. e wird gezeigt werden, dafi sich die Sehwankung von 
ab 
| d/(8") (8) ad" | entlang des i/mfangs niiherungsweise beriicksichtigen lift, indem 


dl 27 nia 
man hier 1/|sinwQ| ersetzt durch 1/|sinaQ@|-+ e, wobei e eine positive Zahl der 


Gr6Benordnung eins darstellt. 


73 - Supplemento al Nuovo Cimento, 
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Die Funktionen 


|e) (7) aol 


ni & ca ‘( 


| 
= 
re 
Soe 
eel 


1. Sektor 2. Sektor 
(111.17) - RA. RE, ni 
| hsm ee a ir! an) ( | (9)(d = Voll ry (9) dd | i 
| = : 1. Sektor 2 eee 


haingen dabei nur noch von der kennzeichnenden Grobe n /M und damit von 
n ab. Zam Verstàndnis des bei dem Winkelfehler auftr RR Terms beachte 
man noch, daB gilt 


eee E = |e (È 2 
(1.1.18) = x SI 


mit Z als geometrischer Linge eines Periodizitàtsbereichs. Da der Radius # 
jedoch durch die gewiinschte Maximalenergie vorgegeben ist, ist eine Ersetzung 
von £f durch L meist nicht zweckmafig. 

Die in Gl. (II.1.17) auftretenden Integrale kénnen unter Benutzung der 
Bewegungsgleichung (E.6) noch weiter umgeformt werden. Man hat namlich (3°), 


| Da (9)r (9) dd = [r1(9)], 
(11.119) a 
iS =n) (0) dP = =) ri(P)| —[7,(9)], 


wobei rechts als Grenzen die beiden Enden des jeweiligen Sektors einzusetzen 
sind. Als Beispiel wurden die Funktionen f,.(n*/M), f;,(n*/M) fiir eine Anord- 
nung mit Sektoren gleicher Linge und einem Feldindex gleichen Betrages. 
aber entgegengesetzten Vorzeichens (vgl. Gl. (E.15)) berechnet; sie sind in 

Abb. 7 dargestellt. Es ist nicht verwunderlich,. 
A dah beide Funktionen an der. oberen Grenze: 
: des Stabilitàtsbereichs gegen unendlich gehen. 
Mathematisch folgt das aus dem Auftreten 
der Wronskideterminante im Nenner der die 


Abb. 7. Die Funktionen j,.(m*/M) und f,,(13/M),. 

die ae Einflu8 statistischer Aufstellungsfehler (Lage: 

dé und Winkel 67) der einzelnen Sektoren auf die 
ne Auslenkung der geénderten Gleichgewichtsbahn 
a beschreiben; vgl. (1I.1.16). 


(*°) Eine eckige Klammer soll hier und in spàteren Gleichungen bedeuten, da} die- 
Ditterenz der Werte der Funktion an den beiden Integrationserenzen —— meist dem 
Endpunkten eines Sektors —- zu bilden ist. 
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Funktionen definierenden Ausdriicke; die Wronskideterminante verschwindet 
namlich an der Grenze eines Stabilitatsbereichs, da r,(9) und r:(9) dort gegen 
die gleiche periodische oder antiperiodische Funktion konvergieren. Bei n*/M=0 
liegt im mathematischen Sinne keine Grenze des Stabilitàtsbereichs vor; das 
Unendlichwerden von Îse(n*/M ) und einigen anderen spater vorkommenden 
Funktionen rihrt dort von der Abspaltung des Faktors (n*/27)} her. 

Wie man aus den berechneten Funktionen Îselm*/M) und fs,(n*/M) sieht, 

spielen bei dieser Anordnung Winkelfehler im allgemeinen gar keine Rolle 
verglichen mit Fehlern in der Lage des Sektorenmittelpunkts. Diese Behauptung 
ist jedenfalls richtig, solange statt einer eigentlichen Winkelmessung eine 
Justierung der beiden Endpunkte eines Sektors vorgenommen wird. Es leuchtet 
auch anschaulich ein, da8 sich die Stérungen durch derartige Winkelfehler 
nahezu kompensieren, falls die Wellenlànge der Betatronsehwingungen groB 
gegen die Sektorenlange ist. Koppelt man dagegen stets einen halben Sektor 
mit positivem Feldindex und einen halben mit negativem Feldindex zusammen 
und justiert jetzt diese neuen Finheiten, so liegen die VerhAltnisse gerade 
umgekehrt. Man hat dann relative Unempfindlichkeit gegen Fehler in der Lage 
der Mitte der Einheit,'dafiir aber eine entsprechend stirkere Empfindlichkeit 
gegen Winkelfehler. 
— In entsprechender Weise ist eine statistische Behandlung anderer Ver- 
fahren zur Justierung der Magnete mòoglich; es kénnen etwa eine begrenzte 
Zahl von Punkten auf dem Umfang in Bezug auf den Mittelpunkt des Beschleu- 
nigers vermessen und die Magnete selbst dann relativ zu diesen Hilfspunkten 
justiert werden. Wesentlich neue grundsatzliche Gesichtspunkte ergeben sich 
hierbei nicht; deshaib soll auf die Einzelheiten nicht eingegangen werden. 

Aufstellungsfehler sind nicht die einzigen Ursachen von Fehlern im magne- 
tischen Fihrungsfeld; z.B. wirkt die statistische Schwankung der remanenten 
Felder in der gleichen Weise. Vermutlich kénnen die Schwankungen dH(?) 
der Feldstàrken der einzelnen Sektoren als unabhAngig von einander ange- 
nommen werden. Es ist jedoch schwierig zu sagen, ob die Feldfehler eines ein- 
zelnen Sektors durch ein iiber die ganze Sektorlange konstantes 6H hinreichend 
gut beschrieben werden. Fir sehr lange Sektoren ist das wahrscheinlich nicht 
mehr richtig, fiir kiirzere Sektoren wird diese Annahme vermutlich ausreichen. 
Mit dieser Annahme erhalt man fiir die durch die Schwankung der remanenten 
Felder von Sektor zu Sektor hervorgerufene Amplitude co der geànderten 
Gleichgewichtsbahn — definiert durch Gl. (IT.1.4) — folgenden Ausdruck (*°} 


I "sa noi ey E (i 
(11.1.20) Cnn = Taina) E (779) tom SAR 
wobei die Funktion 
ni CASS (57 | | ( 3 | ) 
] ica a elms, (2° (9) ad + 
(11.1.21) fon (5) Tale | i 0) | | 


1. Sektor 2. Sektor 
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nur von nÎ/M abhingt. Man sieht aus Gl. (11.1.20), daB diese Stòrung mit 
wachsendem Feldindex abnimmt. Hierin driickt sich die starke Stabilisierung 
aus. Bei Aufstellungsfehlern trat ein entsprechender Faktor n~* nicht auf, da 
eine gegebene Verriickung bei gròBerem Feldindex zu einer entsprechend 
eréBeren Anderung des magnetischen Feldes fiihrt. Speziell fir eine Anordnung 
mit Sektoren konstanten Feldindexes gilt die einfache Beziehung 


(11/122) fou Gr) = = He vr): 


Treten mehrere unabhangige Fehler im Fiihrungsfeld auf (z.B. Aufstellungs- 
fehler und eigentliche Feldfehler), so ist das zugehòrige <_..} durch Jie Qua- 


< 
SÙ max 


dratwurzel aus der Summe der Quadrate der einzelnen Anteile gegeben. 


c) Gednderte Gleichgewichtsbahn, Amplitudenschwankung entlang des U m- 
fangs. — me vorangehenden Unterabschnitt wurde die Schwankung des Aus- 


drucks | foro") 0’) dd entlang des Umfangs, also fir verschiedene Werte 
i 2a 
von 8, vernachlassigt, indem der Quadratmittelwert dieser Gròfe fiir d = 27 


gewahlt und hiermit die Amplitude der Gleichgewichtsbahn abgeschatzt wurde. 
Den damit gewonnenen Mittelwert kann man auch so verstehen, daf zunachst 
eine Mittelbildung iber den Umfang eines einzelnen Beschleunigers (genauer: 
die Punkte d.= 0, do, 20, usw.) 


0 LI) 
5: aa J 
(1.1.23) oo a Jorn (?) yao + | poro (0) SURE al 
— 27 doe—-27 


und anschlieBend eine Mittelung uber die Gesamtheit von Beschleunigern vor- 
genommen wurde. Man hat namlich 


(hist 24) <|P|)> =<{ FI 


(<> = Mittelbildung iber die Gesamtheit, — = Mittelbildung iber den Um- 
fang), da — bei gleicher Fehlerverteilung der einzelnen Periodizitàtsbereiche — 
in der Gesamtheit auch Beschleuniger vorkommen, die sich von einander nur 
durch eine zyklische Vertauschung der einzelnen mit Fehlern behafteten Perio- 
dizitàtsbereiche unterscheiden. 

Durch die Tatsache, da® eigentlich zunàchst ein Mittelwert tiber den Um- 
fang (GI. (11.1.23)) berechnet wurde, wird der fiir |*|,. maBgebliche Maximal- 

d 


wert von 1[6f(9)r: (91) ad" | unterschatzt. In diesem Unterabschnitt soll ver- 
0-27 


UBER DEN EINFLUSS VON FEHLERN DES MAGNETISCHEN FELDES USW. 1121 


sucht werden, eine Abschàtzung fiir diese Schw ankung und ihren EinfluB auf 
|| max zu gewinnen. Wir beschrinken uns dabei auf Bese hleuniger, bei denen 
die Fehler in den einzelnen Periodizitàtsbereichen unabhangig von einander sind. 

Die GròBfe, die in Gl. (1.1.18) als Faktor von (7) auftritt und abkirzend 
mit @,(9) bezeichnet werden soll, kann in zwei Bestandteile aufgespalten werden 


Î OF (PH) 7 (8) ad! 


11.1.25 i(P) = DE SIE 2. 
; Ae W (1 — exp [— 2zi@)) 
DI d') r1(9') dd‘ a 
0 i 


wa I: w Jono aes 


Der erste Summand hangt nicht von der Koordinate # ab; der zweite enthalt 
die Schwankung entlang des Umfangs. Falls Q nahe bei einer ganzen Zahl 
ist, iberwiegt der erste Term bei weitem und die Schwankungen werden be- 
deutungslos. Das ist in Ubereinstimmung mit Uberlegungen am SchluB von 
Unterabschn. a. Fir ? = 0 verschwindet der zweite Summand, fiir 9 = 2a 
aber kombiniert er mit dem ersten Summanden zu 


{6} (91) ad 


(II.1.26) a,(277) = W (= exp 2760) = exp [2771 J a.(0) 
also zu einer komplexen Zahl vom gleichen Betrag wie 4,(0) aber gednderter 
Phase. 

Eine vollstàndige Untersuchung des Schwankungsgliedes ware schwierig; 
deshalb werde eine vereinfachte Abschatzung durcehgefùhrt. Die Abweichung 
der GréBe |@,(2)! von dem Wert |d,(0)| wird statistisch am gròBten sein fiir 
d =; Der Sat Mittelwert des zweiten Gliedes ist dort gegeben durch 


ay Mulo. 
AREA 


(11.1.27) ( | df(9) (9) ad 


pers. 


vel. Gl. (11.1.9). Figentlich ware eine genauere Untersuchung der Korrelation 
der Phasen des ersten und zweiten Terms in Gl. (II.1.25) erforderlich. Der 
Rinfachheit halber soll aber die ungiinstige Annahme gemacht werden, dab 
sich beide komplexen Zahlen bei 9 = a gerade in Phase addieren. Das wurde 
bedeuten, daB der Faktor 1//sinaQ) in den Gleichungen (IT.1.4), (TL I5103; 
(Pig b5) (Lb 7), (1151.20) far ‘> > gu ersetzen ist durch 1/|sina@Q| + V2. 


max 


‘ — 
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Hierbei ist der zweite Summand aber sicher noch iiberschatzt, da |4@,(0):2 im 
allgemeinen kleiner sein wird als der Mittelwert, falls sich beide Summanden 
in Gl. (IT.1.25) fiir 9 = x gerade in Phase addieren. Das Resultat ist somit, 
daB man einen besseren Erwartungswert fiir die Maximalamplitude der gean- 
derten Gleichgewichtsbahn erhalt, falls man Gl. (II.1.10) ersetzt durch (4) 


e A | ex < ak, 1 ! 
(11.1.28 NERE | 
(I 1 ) Vmax) |W | sin 770 | Ce 


wobei e eine Zahl der GròBenordnung eins ist. Man beachte, daB 1/|sin 2Q| 
fiir halbzahliges Q den Wert eins hat; dort wird man aber nicht arbeiten wegen 
des Auftretens der instabilen Bereiche (Teil I, Abschn. 2). Fir einen Wert 
von Q, der sich um 1/4 von einer ganzen Zahl unterscheidet, ist 1/|sin 70 |= V2; 
das Rorrekturglied e kann dann durchaus von Bedeutung sein. 

Es sei nochmals daran erinnert, daB Gl. (IT.1.28) keinen wohldefinierten 
Erwartungswert sondern nur eine Abschitzung darstellt. Trotzdem sollte diese 
Gleichung die zu erwartende Amplitude der geanderten Gleichgewichtsbahn 
verhàltnismafBig gut wiedergeben. Unsere vereinfachte Abschatzung hat uns 
nicht erlaubt, eine Aussage tiber die Abhangigkeit der GréBe e von M und Q 
zu gewinnen, lediglich ihre Gr6Benordnung (eins) konnte plausibel gemacht 
werden. 


d) Instabile Bereiche. — Bei der statistischen Berechnung der durch Fehler 
im Feldindex hervorgerufenen instabilen Bereiche (Teil I, Abschn. 2) treten 
weniger mathematische Komplikationen auf als bei der Untersuchung der 
geanderten Gleichgewichtsbahn. Dort lagen, wie in den vorangehenden Ab- 
schnitten besprocben wurde, die Schwierigkeiten darin, daB sich die Maximal- 
amplitude der geanderten Gleichgewichtsbahn nicht in einfacher geschlossener 
Form. durch die Funktion df(#) ausdriicken laft. Bei der Untersuchung der 
instabilen Bereiche sind, wenigstens bei Beschrinkung auf die niedrigste st6- 
rungstheoretische Naherung, gema8 Gln. (1.2.30) (1.2.31), (1.2.25) einfache 
Integrale tiber die Storung dn(9) des Feldindex auszufiihren. 

Zunachst sei bemerkt ,da8 eine Berechnung von dQ%,, d.h. des Abstandes 
des Mittelpunktes eines instabilen Bereichs von einem Arbeitspunkt des zu- 
geordneten idealen Beschleunigers mit ganz- oder halbzahligem @ ohne prakti- 
sches Interesse ist. Die Verschiebung benachbarter instabiler Bereiche ist 
namlich, wie gleich gezeigt werden soll, fiir groBes M nahezu dieselbe; von 
Bedeutung ist aber allein die Breite des zwischen zwei instabilen Bereichen 
zur Verfigung stehenden stabilen Gebiets. Die Lage des Mittelpunktes eines 


(91) Das Zeichen < >ymo ist in Gl. (IT.1.11) erklart. 
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instabilen Gebiets ist nach Gl. (1.2.31) gegeben durch 


270 


Nio 1 + 
(IT.1.29 jo® ira ae 
{ ) Lo Dt 9 iW a? DA aw | On() 7 (0) 2d9 . 


0 


Bei einer Anderung des Arbeitspunktes andern sich Betrag und Phase von #;(#). 
Bei Ubergang von einem Resonanzpunkt zum niichsten sind beide Anderungen 
fur groBes M klein; die Phasenànderungen kénnen sich hier (im Gegensatz 
zu dem Ausdruck ftir 6@{?.. siehe Gl. (I1.1.31)) nicht auswirken. Damit 


ist gezeigt, dafi die Verschiebungen benachbarter instabiler Bereiche in der 


Tat fiir grosses M gleich sind. 


Die Breite eines instabilen Bereichs ist nach Gl. (1.2.30) gegeben durch 


(1.1.30) 00, === 


Ahnlich wie bei der Behandlung der geinderten Gleichgewichtsbahn werde 


gesetzt 
KET.1.31) na = ù Om exp [2i(m — 1) Ny] 
m=1 
mit 
mio 
{II.1.32) Cp = [orto (9) exp[— i(m — 1)u]})} dd. 


(m —DO, 


Es werde wieder die Gleichwahrscheinlichkeit positiver und negativer Mehler 


postuliert 


{17.1.33) GOA 


ferner Gleichheit der Fehlerverteilung in den einzelnen Periodizitàtsbereichen 
und statistische Unabhaingigkeit der Fehler in verschiedenen Periodizitàts- 


bereichen, also 


(II.1.34) (not) = duo] « 
RA 
Man findet damit fiir den Mittelwert von | N1|° 


(11.1.35) Wa] = Melo 
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und fiir die Breite des instabilen Bereichs (8!) 


yi 


(I tipi! .36) dii wma T aiWl 


| O | Wmoa * 


Wiederum tritt die fiir die statistische Unabhangigkeit der einzelnen Perio- 
dizitàtshereiche charakteristiche VM auf. 

Die weitere Auswertung des Ausdrucks (1I.1.36) hàngt von der Ursache 
des Fehlers im Feldindex ab. Nach Unterabschn. ) der Einleitung kommen 
insbesondere individuelle Unterschiede der einzelnen Magnete und Aufstellungs- 
fehler entlang der Gleichgewichtsbahn in Frage. Zunachst sollen Magnetfehler 
betrachtet werden. Setzt man wieder die Aufteilung jedes Periodizitàtsbereichs 
in zwei Sektoren voraus (vgl. die Bemerkungen in Zusammenhang mit 
an. (17.1.12), (1T1.1.13)), und nimmt man ferner an, daf es physikalisch ge- 
rechtfertigt ist, jedem Sektor einen konstanten Fehler dn zuzuordnen, wahrend 
die Fehler verschiedener Sektoren statistisch unabhangig sind, so hat man 


DLE 


(1L.1.37) CO Pwmne = | | (7,(9))?. dd | ia | II . 
| | | 


1. Sektor 2. Sektor 


Anwendung der Ahnlichkeitsgesetze von Teil I, Abschn. 3b ergibt, daB fiir 
Ahnliehe Anordnungen folgende Gleichung gilt 


ud / on ni 
11.1.38) OQinw wma = NEC > a (E; 
di inte : È È n /wmo lo M 
mit 
AA n} A | A 
(11.1.39) fs) = ci I) (r,(9))2a0 | + | | 
i 1. Sektor 2. Sektor 


Die GròBe stellt wiederum den Absolutwert des Feldindex in einem bestimm- 
ten Punkt des Periodizitàtsbereichs dar, z.B. den Maximalwert. Die Funktion 
“fs,(n?/M) wurde fiir eine Anordnung mit Sektoren gleicher Linge und einem 
Feldindex gleichen Betrages aber entgegengesetzen Vorzeichens berechnet 
(Abb. 8). Thr Unendlichwerden an der oberen 

) Grenze des Stabilitàtsbereichs hat denselben 

si Grund wie bei den entsprechenden Funktionen 

in Unterabschn. db. 
ay Aufstellungsfehler entlang der Gleichgewich- 


Ms den EinfluB von eigentlichen Fehlern im Feldindex 
2 auf die Breite der instabilen Bereiche angibt. 


Dl Abb.8.- Die Funktion j;,(2°/23), die gemas G1. (11.1.38) 
3 Se 
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tsbahn kénnen auf einem Unterschied der Linge der einzelnen Sektoren oder 
auf einem eigentlichen Fehler der Aufstellung beruhen. Es soll zunichst die 
zweiteMboglichkeit betrachtet werden, deren Behandlung nur sinnvoll ist, falls 
die einzelnen Sektoren durch feldfreie Stiicke getrennt sind. Verriickt man 
einen Sektor um den Winkel dd entlang der Gleichgewichtsbahn, so gilt ftir 
die Anderung-des- Feldindex an einem festen Punkt der Gleichgewichtsbahn 


II.1.40) (e sea 
( on(?) 6 i 


Partielle Integration tiber den Sektor (und etwa das damit verbundene Streu- 
feld im feldfreien Teil) liefert fiir die auftretenden Integrale zunàchst (3°) 


ie n) 


(r1(9)) dd = [n(9)(71(9))?] — 2 [n(d) r(9)r:(9) dd, 

wobei der erste Term fortfallt, da die Grenzen des Integrationsbereichs (Sektors) 
in einem feldfreien Teil (n=0) legen. Anwendung der Bewegungsgleichung 
und erneute partielle Integration ergeben dann 


ae” ag ("(9))2dd — — [(rs(9))*] 


(vel. Gl. (IT.1.19)). Nach den Ahnlichkeitsgesetzen ist dieser Ausdruck, wenn 
durch W dividiert wird, fiir ahnliche Anordnungen proportional zu n°. Fiir 
den durch solche Aufstellungsfehler hervorgerufenen Beitrag zu dQ)? erhilt 
man somit 


a ID) 
(11.1.43) 6900) = nid8) fool 57) 


wei Wing 


wobei die Funktion (9°) 


Il 


Lara IL E rr DE 
| iW nd (MOT 2) 


M 1. Sektor 2, Sektor 


[1.1.44 j (e 
( OL EOS ) da M 
wiederum nur von n°/3 abhingt. Ubrigens hat man nach Gl. (11.1,18) 
(iddio) ni od = Zar n 7 4 


i j 8 i r] a sy i a In 
wobei dl die geometrische Verriickung eines Sektors und L wiederum die Pe 


riodenlinge darstellt. 


Pe ~ ree | >) 
» 
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Auch diese Funktion wurde fiir die Anordnung mit zwei Sektoren gleicher 
Linge und mit Feldindizes gleichen Betrages aber entgegengesetzten Vor- 
zeichens berechnet. Das ist eigentlich nicht. ganz sinnvoll, da bei dieser Anord- 
nung angenommen ist, da die Sektoren unmittelbar an einander stossen und 
unabhiingige Verriickungen daher nicht méglich sind. Wenn trotzdem 
Gl. (11.1.44) benutzt wurde, so hat man das Ergebnis so zu interpretieren, dab 
man zwischen den einzelnen Sektoren kleine feldfreie Bereiche annimmt, die 


% 
Ria 
5X gg (7) 


ne nie 

ci DE 0,3 = 2M i a OF 03 # 2M 
Abb. 9. — Die Funktion fzg(x3/M), die Abb. 10. — Die Funktion f39(n3/M), die 
gema8 Gl. (II1.1.43) den EinfluB tangen- gemiB GI. (II.1.43) den Einfluf einer 
tialer Aufstellungsfehler der Sektoren Verlangerung (oder Verkirzung) der Sek- 
auf die instabilen Bereiche angibt. toren auf die instabilen Bereiche angibt. 


den Verlauf der Betatronschwingungen noch nicht wesentlich andern sollen. 
Die GréBenordnung des Ergebnisses ist zugleich kennzeichnend ftir die Er- 
gebnisse, die man bei anderen Anordnungen zu erwarten hat. Die Funktion 
fso(n*/M) ist in Abb. 9 dargestellt. 

Von starkerem EinfluB als tangentiale Aufstellungsfehler sind Fehler in 
der Lange der einzelnen Sektoren. Es gelten wieder Formeln ahnlich Gl. (I1.1.43), 
wobei 02 jetzt die Verlangerung (bzw. Verkirzung) eines Sektors bedeutet. 
Fur die Anordnung, die allen zahlenmaBigen Rechnungen dieser Arbeit zu Grun- 
de liegt, wurde die Funktion f;(n*/M) berechnet und in Abb. 10 dargestellt. 


2. — Wahrscheinlichkeitsverteilungen. 


a) Gednderte Gleichgewichtsbahn. — In diesem Abschnitt werden die sta- 
tistischen Untersuchungen auf die Wahrscheinlichkeitsverteilung einiger GròBen 
ausgedehnt. Die Ermittlung einer Wahrscheinlichkeitsverteilung soll zunàchst 
fir die Amplitude der geànderten Gleichgewichtsbahn durchgefiihrt werden. 
Wegen der schon besprochenen Schwierigkeiten (vgl. Schlu8 von Abschn. la) 
soll sich die Untersuchung auf die GròBe F (G1. 1T.1.3) beschrànken. Wiederum 
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soll die Annahme der Gleichwahrscheinlichkeit positiver und negativer Fehler 
und der Unabhangigkeit der Fehler in verschiedenen Periodizitàtsbereichen 
gemacht werden. 

In Gl, (17.1.5) sind die GròBen gy, dann unabhangig von einander. Trigt 
man die auf der rechten Seite der Gleichung stehende Summe in der kom- 
plexen Ebene auf, so erhàlt man einen Streckenzug, wobei die Langen der ein- 
zelnen Strecken unabhingig von einander sind und alle dem gleichen stati- 
stischen Gesetz gehorchen. Man hat also ein typisches Irrfahrtproblem in zwei 
Dimensionen. Fir die Lange des Vektors F (im Sinne des in der komplexen 
Ebene iblichen Sprachgebrauchs) wird man also eine zweidimensionale GauB- 
vertellung (Raleighverteilung) der Form 

) 


{11.2.1) w(|F|)d|F|= ZR 


erwarten. Die Grobe ¢ F\*> bedeutet den quadratischen Mittelwert von # 
Die Wahrscheinlichkeitsverteilung (11.2.1) ist zusammen mit der integrierten 
Wahrscheinlichkeitsverteilung 

ir 
(11.2.2) WF) = Jou F ya p' 


0 


ee sl aa 
== jl ei va Fs 


in Abb. 11 dargestellt. Der Beweis fiir die (nàherungsweise) Gultigkeit von 
GJ. (11.2.1) wird gleich gegeben werden. Zunachst aber sei bemerkt, daB nach 
dieser Gleichung die GròBe der zu erwarten- 

den Werte von |} recht gut durch den Ni 
in Abschnitt 16 berechneten Wert von 
<|P|}rmo gekennzeichnet ist. In etwa 

60% aller Falle ist |F\<<!F|>,m_o, in ost 
nur 10% aller Falle hat man | F|>1.5- 
-AF)>y mo: und es ist sehr unwahrscheinlich 
(weniger als 2% aller Falle), einen Wert x > te o 


(IF +w(1FI) 
\ ‘wing 


von. |F|. zu finden, der gròBer ist als yma 
2<|P Ow Hee Abb. 11.— Ditferentielle und integrierte 


Der Beweis fiir die Giiltigkeit der | Wahrscheinlichkeitsverteilung der 
. . . . 1.224 14 Ri x 99 
Wahrscheinlichkeitsverteilung soll durch GroBe | F] PS (AUEZZIOE (11.2.2). 
Berechnung der Momente von | Fl, dh. Die Grobe OC inet gehorcht nach Gil. 
È /\ Blan (11.2.7) den gleichen Wahrscheinlich- 
der Mittelwerte <|F|?*) der Potenzen O 
erfolgen, wobei man sich auf die geraden 
Potenzen beschrinken darf. Es wird 
sich zeigen, daB die Momente <.F|?%> fir nicht zu grobes k (k <M) nàhe- 
rungsweise mit den Momenten iibereinstimmen, die sich unter Benutzung von 
j i ie a0 \ » 
{11.2.1) ergeben. Die Tatsache, daB die unmittelbar und die nach Gl. (IT.2.1) 
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berechneten Momente nur fiir nicht zu groBes k, und auch dann nur naherungs- 
weise, iibereinstimmen, bedeutet, daB die Wahrscheinlichkeitsverteilung (11.2.1) 
auch nur niherungsweise fiir nicht zu grobes F erfillt ist. Bei genigend 
groBem Wert von M machen sich die Unterschiede aber erst in einem Gebiet 
bemerkbar, in dem sowohl die wahre als auch die durch Gl. (11.2.1) gegebene. 
Wahrscheinlichkeitsverteilung bereits vernachlassigbar klein sind. 

Das Moment <|F|2» wurde bereits in Gl. (II.1.9) berechnet. Dabei wurde 
wesentlich Gebrauch gemacht von Gl. (11.1.7). (Gleichwahrscheinlichkeit posi- 
tiver und negativer Fehler) und Gl. (11.1.8) (gleiche Fehlerverteilung in allen 
Periodizitàtsbereichen, jedoch statistische Unabhangigkeit der Fehler in ver- 
schiedenen Periodizitàtsbereichen). In Verallgemeinerung von Gl. (11.1.7) soll 
gefordert werden, daB alle Mittelwerte mit einer ungeraden Anzahl von g-Fak- 
toren verschwinden, waàhrend in Analogie zu Gl. (Ii.1.8) beispielsweise gilt 


(I1.2.3) PiPyPmP m'. == Onn Oem: A Om Oven FG 20m Orsi) P foe = 
cs (O11 Om wa Dim 11: Om’) (p* > Dee O1mO 1: Omm: 1p|*> = 


Damit erhalt man ftir das Moment 


M 
(II.2.4) |\F\4 = > exp[i(l + l— m— m')ulkoigry*e*,> = 


1,022,’ =1 


— 2M(M—1)<\p|>?4 (= > si 
wid: — COS ZU 


= x) <p + Ml gl 


Hieraus kann man ablesen, daB naàherungsweise gilt 
(JIlE245)) LSM OLE 


(vgl. GI. (11.1.9)), falls M >1 und falls alle Mittelwerte <|g|®)?, | <p> 12, X lq 
etwa die gleiche GròBenordnung besitzen. Die GréBe (1 — cos 470)/(1 — cos 2) 
ist stets endlich und geht an den Grenzen eines Stabilitàtsbereichs gegen null. 
in Verallgemeinerung der zu Gl. (11.2.5) fiihrenden Rechnung findet man, 


da gilt 


(11.2.6) <P> we KI PSE 
solange 4<M ist. Diese Beziehung ist aber gerade kennzeichnend fir die Wahr- 
scheinlichkeitsverteilung (17.2.1). 
; oti hale 2 py p94 SRO 10 va 1 N Tae + Pay 

by Instabile Bereiche. — Fiir die GròBe 6Q%, (vgl. Gl. (1.2.30)), die fiir 
die Breite eines instabilen Bereichs kennzeichnend ist, verlaufen die Rech- 
nungen genau so wie fir # im vorangehenden Unterabschnitt. Statt Gl. (1.1.5) 
ist jetzt Gl. (111.31) zu Grunde zu legen, die wiederum als zweidimensionale 
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Irrfahrt gedeutet werden kann. Man erwartet also die Wahrscheinlichkeits- 
verteilung 


yest oe), 
(ET.2.7) w(dQ.,) d(dQ@,) = Li. exp 76955 PORTO 
ins / inst/ 


vgl. Abb. 11. Sie ergibt sich aus einer Berechnung der Momente von OU ae 
Entsprechend Gl. (17.2.3) hat man 


RQ N N } N N 92 
(17.2.8) 0/0) Ono = (OimO tm: alte O 0 aw 20im0 1! Opn )<| Oa E 


LE L (Oxy Onna: tay Onn 1 Oven) |Ko® 24 OO Onan }o|* 


und damit 


(11.2.9) <|N;,|> = 2M(M—1)<\o|*)?4 (= 008 Se _at\|foty 2 + Mob, 
i i 1 — cos 4u a peed 


wobei aber (1 — cos 870)/(1 — cos 4u) wegen des streng ganz- bzw. halbzah- 
ligen Wertes von @ exakt verschwindet. Allgemein findet man fiir k< M 


(11.2.10) Na PD 41 Meo (2% = 415, 


was fur die Wahrscheinlichkeitsverteilung (II.2.7) kennzeichnend ist. 

Von gròBferem praktischem Interesse als die Breite eines einzelnen insta- 
bilen Bereichs ist diejenige des stabilen Bereichs zwischen zwei benachbarten 
instabilen Bereichen, die wegen der (n&àherungsweise) gleichen Verschiebung 
benachbarter instabiler Bereiche (siehe Anfang von Abschn. 1d) gegeben ist 
durch 


(1¥.2.11) OV» = $(1 — (Qrwe — (OQnwo+s) - 
Hierbei bedeuten (0Q%.), und (0Q%?.),,, die Breite der instabilen Bereiche, 


wenn die Zahl der Betatronwellaniangen auf dem Umfang gleich Y und wenn 
sie gleich Q+ 4 ist. Es soll deshalb die Wahrscheinlichkeitsverteilung der GroBe 


(11.2.12) i) 

berechnet werden, die allgemein gegeben ist durch (**) 
Rae 

(I1.2.13) w(4)= fu fu (OQ eros (OR 042) (A — (dna 


— (OQ) 5 4) HOO) UR 


(82) d(x) ist die Diracsche Deltafunktion. 
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Weiter unten soll gezeigt werden, da die Breiten benachbarter instabiler 
Bereiche fiir groBes M von einander praktisch statistisch unabhangig sind; 
daB also gilt 
5 (1) ari (1) ; (1) 

(1I.2.14) w((OQsrat) 9? (OQ inet) 0+ 3) os 10 (( (00; inet) @ )ew((de inst) @ Da) È 
Hier ist fiir die Wahrscheinlichkeitsverteilungen auf der rechten Seite Gl. (11.2.7) 

. . . e . / (1)2 . . J 
cinzusetzen. und zwar mit in beiden Fallen gleichem <6Q;2.>. Hine einfache 


inst/ 
Rechnung ergibt fiir die Wahrscheinlichkeitsverteilung {TI:2-15)0(®) 


213 
(12.15) OG eae w(A) = cr \ exp - man > 


inst /Wma@ yin st 


re AE 1 on A? -|2( AS \ 
| alga | XP | 20690 FG 


und fiir die integrierte Wahrscheinlichkeitsverteilung 
(II.2.16) = je '\ dA’ 
das folgende Resultat (83) 


{ | 
(12.17) W(4)=1—exp| — 7 sons — 


inst 
A A AR sages a | A 
Maraini OR Deis “CAR ae 
Wo COIS È 97802 vi (1) 
2 COQ inst) wma 2< OQinst Vv 2¢ OQinct mo 


Beide Wahrscheinlichkeitsverteilungen sind in Abb. 12 aufgetragen. Die ganze 
Rechnung ist natiirlich nur sinnvoll fiir Breiten A, die klein gegen eins sind. 

Es ist noch der Beweis fiir die statistische 
Unabhangigkeit der Breiten benachbarter 
instabiler Bereiche nachzutragen. Es kann 
etwas allgemeiner gezeigt werden, daf die 


Abb. 12. — Differentielle und integrierte Wahr- 

scheinlichkeitsverteilung fiir die Summe A der Brei- 

ten benachbarter instabiler Bereiche nach Gln. 
(11.2.15), (1.2.17). 


(33) D(x) stellt das sogenannte Fehlerintegral dar, P(x) = (2/,/a)fe-™ dy; vgl. etwa. 


0 
E. JAHNkE u. F. Empr: Funktionentafeln (Leipzig und Berlin, 1938). 


~ 
È; 
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Breiten von instabilen Bereichen, die (bei Q und (') nicht zu weit von einander 
entfernt liegen, statistisch unabhingig von einander sind. Es ist hierfiir zu 
beweisen 


A A 4 
2k| NQ) [ons Z| NQ ENZO] 
Ns | Se N a Waa mS 


(11.218) «N 


Dabei sei angenommen, da8 die einzelnen Beitràge o,, sich bei Ubergang von 
Ni? zu N° praktisch nicht andern und daB der Unterschied beider GròBen 
allein von der Anderung der Phase 4 herrihrt. Gl. (I1.2.18) soll explizit vor- 
gerechnet werden fir & = k'— 1. Unter Verwendung von Gl. (IT.2.8) erhilt 
man 


1— cos 47(Q—-Q') 
1 — cos 2(u—u') 


2M\K]o5 ba 


(11.219) <|A@|}}f ©] = [x | 
De 


>|? + M<|o]s)- 


(= cos 47(Q + Q') M\\/0 
RE ISS7 ie Jo 


Hierbei verschwindet (1 — cos 42(Q+Q’))/(1 — cos 2(u+wu’')) identisch, wahrend 
(1 — cos 42(Q — Q'))/(1 — cos 2(u — w')) verschwindet, falls Q und Q’ wirklich 
verschieden sind. Fiir groBes M ergibt sich die rechte Seite von Gl. (I1.2.19) 
in der Tat gleich 


A A r 
(IT.2.20) Me Vol? ax Na? SNO 


2 
1 4 LI 


Entsprechend schlieBt man fiir gréBere k und k’. 


3. — Zusammenfassung von Teil II. 


Um die Storungen abschatzen zu kònnen, die in einem Synehrotron mit 
starker Stabilisierung zu erwarten sind, wurde in diesem Teil eine Auswertung 
der in Teil I abgeleiteten Ergebnisse unter statistischen Annahmen fir die 
verschiedenen Fehler durchgefiihrt. Speziell fiir eine Anordnung, bei der jeder 
Periodizitàtsbereich aus zwei Sektoren gleicher Linge mit Feldindizes gleichen 
Betrages aber entgegengesetzen Vorzeichens besteht, wurden die Ergebnisse 
quantitativ in Form von Kurven (Abb. 7 bis 10) angegeben. 

Bei Untersuchung der geànderten Gleichgewichtsbahn lieB sich eine etwas 
grébere Abschàtzung nicht vermeiden (Abschn. 1a), auch die Schwankung der 
fiir die Amplitude der Gleichgewichtsbahn kennzeichnenden GròBe entlang des 
Umfangs wurde nur abgeschitzt (Abschn. 1c). Die eigentlich quantitative 


| Behandlung dieser GròBe erfolgte in Abschnitt 1b, wàhrend ihre Wahrschein- 


lichkeitsverteilung in der gedachten Gesamtheit in Abschnitt 2a untersucht 


wurde. Lai 
Bei Untersuchung der instabilen Bereiche ist die Verschiebung der Mittel 
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punkte dieser Bereiche ohne Interesse, da benachbarte instabile Bereiche 
praktisch gleich stark verschoben werden. Der quadratische Mittelwert der 
Breite eines instabilen Bereichs wurde in Abschnitt 1¢, die Wahrscheinlichkeits- 
verteilung dieser GròBein Abschnitt 2b behandelt. In Abschnitt 2b wurde ferner 
die Wahrscheinlichkeitsverteilung fiir die Summe der Breiten benachbarter 
instabiler Bereiche untersucht; aus ihr 1aBt sich leicht die eigentlich interessie- 
rende Wahrscheinlichkeitsverteilung fiir die Breite des stabilen Bereichs 
zwischen zwei instabilen Bereichen gewinnen. 


Es ist dem Verfasser ein Bedùrfnis, den Herren Prof. N. BoHR und Prof. W. 
HEISENBERG fiir ihr freundliches Interesse und fiir anregende Diskussionen 
vielmals zu danken. Die Klarung der behandelten Probleme wurde wesentlich 
gef6rdert durch zahlreiche Gespràche mit Mitarbeitern der Theoretischen 
Studiengruppe und der Protonen-Synehrotron-Gruppe der Europàischen Organi- 
sation fiir Kernphysikalische Forschung (CERN) und mit Angehòrigen des 
Instituts fir Theoretische Physik der Universitat Kopenhagen, des Max 
Planck-Instituts fir Physik, Géttingen, und des Physikalischen Instituts der 
Universitat Bonn. Ihnen allen sei auch an dieser Stelle sehr gedankt. 
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Bewegungsgleichungen in linearer Naherung. 


Die in der Einleitung, Gln. (E.2) und {E.3), gegebenen Bewegungsgleichungen 
fir ein Teilchen im Synchrotron sind nicht neu. Aus den bisher gegebenen 
Ableitungen (*) wird aber nicht ganz klar, ob die Gleichungen ohne Zusatz- 
glieder auch gultig sind, wenn Fiihrungsfeld und Feldindex nicht entlang der 
Gleichgewichtsbahn konstant sind. Tatsàchlich wird sich zeigen, daB derartige 
Zusatzglieder zunàchst durchaus auftreten, sich in den endgiiltigen Bewegungs- 
gleichungen aber gerade fortheben. Aus diesem Grunde ist die nachfolgende 
Ableitung der Bewegungsgleichungen vielleicht nicht ganz ohne Interesse. 


a) Vernachlassigung der Beschleunigung. — Zunàchst werde die Wirkung 
der Beschleunigungsspannung vernachlassigt, das Problem also fiktiv als ein 
stationàres behandelt. R(t) stelle den Radiusvektor einer méglichen Teilchen- 
bahn, dar, die der Einfachheit halber in einer festen Ebene liegen mòge. Fiir 
ein Teilchen richtiger Energie wahlt man fiir diese Bahn zweckmiissig die in 


(34) D. W. KeRst und R. SERBER: Phys. Rev., 60, 53 (1941); N. M. BLAGHMAN 
und E. D. Courant: Rev. Sci. Instr., 20, 596 (1949). 
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sich zuricklaufende Gleichgewichtsbahn. Auf der Bahn R(t) lauft das Teilchen 
mit konstanter Geschwindigkeit » um, so daf fiir die zeitliche Ableitung gilt 


. 
= 


(A.1.1) R(t) = vt,(t). 
Der Vektor £,, der seinerseits natùrlich fo 
von der Zeit t abhangt, bedeutet den > ù 


Tangenteneinheitsvektor an die Bahn. 

Es werde ein entlang der Bahn R(t) SI e EE 
umlaufendes begleitendes Dreibein ein- Re 
geftihrt, das aus dem Vektor ¢,(t), dem Abb. 13. — Der Vektor R und die Achsen 
auf der Bahnebene senkrechten Einheits- 
vektor 2, und der in der Bahnebene liegen- 
den Normalen r,(t) besteht (Abb. 13). 
Unter Benutzung rechtwinkliger kartesischer Koordinaten 7, 2, 7 in dem 
begleitenden Dreibein kann jeder Punkt R dargestellt werden in der Form 


des mitbewegten Dreibeins. 


(A.1.2) R= Rit) + 4,(t)t + ae + ro(t)r. 


Die folgenden Rechnungen sollen sich auf die Umgebung der Bahn R(t) be- 
schrainken; héhere als erste Potenzen der Koordinaten 7, z, 7 sollen daher stets 
fortgelassen werden. 

Fir eine umlaufendes Teilchen sind 7, z, » Funktionen der Zeit. Unter 
Benutzung von 


(A 1.3) is =—1r-, ry = to- 


mit o als Krimmungsradius der Bahn R an der betreffenden Stelle findet man 
fiir die zeitlichen Ableitungen von R 


feaafesetieghent seg) 
(A.1.4) | = to 2 CR] 


PA aerieneRtbe 


Fiir spiter (Gl. (A.1.13)) ist die Bemerkung wichtig, dab derselbe feste Raum- 
punkt wegen der Bewegung des Dreibeins zu verschiedenen Zeiten verschiedene 
Koordinaten hat. Aus der Bedingung 


R= 


(A.1.5) R(t+ dt, r+-dt, 2+ dz, r+dr) = Ri, t, 2,7) 


folgt unter Benutzung der ersten Gl. (A.1.4) 
r T 
(A.1.6) dz =— vdt (1 + 4 Me dre VEE ; 


74 — Supplemento al Nuovo Cimento. 
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was sich fur.r, 7-0 zu 


(A.1.7) d7=—=-vdt, dedi 0 
reduziert. è 
Das Magnetfeld H(R) in der Umgebung eines Punktes R werde dargestellt 
(A.1.8) H(R)=z2,H,+ t(Kut + K,k#+K,s)+ 
+ 2 (Ket + Kose + Kogr) + 
+ ro(Ks,t + Kaz + Kr) + quadr. Glieder. 


Dabei hingen H, und die Koeffizienten K,;, nur von t (wegen der Bewegung 
des Dreibeins), nicht aber von 7, 7, ab. Es ist beispielsweise 


lal oH, 
(A.1.9) Ko. = Sap ’ Ko, = Soap 
Aus den Bedingungen 
(A.1.10) divH=0, rot H= 0 
folgt fur die Koeffizienten K,, 
(ASTI) ; Ky, + Ky» + KH, = 0 ’ Ki = Ki; . 


Ein zeitlich konstantes Magnetfeld wird zu verschiedenen Zeiten durch ver- 
schiedene H,, K;, usw. dargestellt. Aus 


(A.1.12) H(t4dt,t+d7,z+dz,ry+dr)= H(t, t, 2, 7) 
und Gl. (A.1.7) folgt 


(A.1.13) HH, — v(&)Ky, + ZH. + rof31) + kleine Glieder= 0, 


wobei die «kleinen Glieder » in Tt, z, 7 mindestens von erster Ordnung sind. 
Aus dieser Gleichung erhalt man (unter Mitbenutzung von Gl. (A.1.11)) 


K,0= K0= H, , Ky, = Ky = K,=0, 


A.1.14 S sv 
) Ky, + Ky = 0, Ky, = Ky. 


Jetzt kann in der rechten Seite der Bewegungsgleichung 
(A.1.15) (mR): =-Rx H 


das Vektorprodukt Rx H errechnet werden. Bei Beschrankung auf lineare | 


a 
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Glieder findet man 


Ros > hx H= nf iL. + zo{_ VK ye — VK gr} + 


ery tee (1 | 3 H,t CHA Ka + Karl 


und somit unter Benutzung von Gl. (A.1.4) und Beachtung von 
(A.1.17) m=0 


die drei skalaren Bewegungsgleichungen 


fi 
Q Q 2 oO 


| 
|i=- Lett, 
ete) lass v2 r vo, 1 e r 
iL 42) ati) = (Ho(1 o 
| Q 0 0 Q Me Q 


abe Ha + H,t + Ke + Ksr)) . 


Hierbei bedeutet m die der konstanten Geschwindigkeit v entsprechende rela- 
tivistische Masse. 

Nunmehr wird von der Voraussetzung Gebrauch gemacht, daB R(t) eine 
Bahn darstellt, die unter dem EinfluB des Magnetfeldes H(R) wirklich durch- 
laufen werden kann, d.h. 7r=<=r=0 soll eine Lòsung des Systems (A.1.18) 
sein. Aus der ersten oder dritten Gleichung findet man die Bedingung (H.1) 


i, pel1\: 
(A.1.19) HE. 
ce e\o 

Damit vereinfachen sich die Bewegungsgleichungen in folgender Weise 

#+0(2) =o, 

È 

der Das oo | oH, oH, 
(A.1.20) EGEO at 

ALT v? | oH, ae oH, 

out ° da SL 


Die Geschwindigkeit v eines Teilchens im Magnetfeld ist zeitlich konstant (ftir 
das auf der Gleichgewichtsbahn umlaufende Teilchen wurde hiervon bereits 
in Gl. (A.1.1) Gebrauch gemacht); in linearer Naherung gilt als Folge von 
Gl. (A.1.4) also 


(A.1,21) i+07=0. 


Diese Gleichung stellt ersichtlich die erste Gl. (A.1.20) in integrierter Form dar. 
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Diese erste Gleichung kann also fortgelassen werden, wahrend sich die beiden | 
anderen unter Verwendung von Gl. (A.1.21) folgendermafen schreiben 


vp? È oH, a oH, 

(A.1.22) = Hi da Da ) 
ig (2 2 v? | oH, lee oH, 
r+(2) r= pa See 


Fiihrt man hier noch die Koordinate ? an Stelle der Zeit ¢ vermdge der Be- 
ziehung 
0), 


(A.1.23) dd = =, dt 


ein, so erhàlt man ersichtlich die Gln. (E.2) und (E.3) der Einleitung. In 
magnetischen Linsen mit H, = 0 ist H,o (im Nenner) natiirlich nach Gl. (A.1.19) 
durch den Impuls auszudrticken (*). 

R.Q.Twiss (unveròffentlicht) machte darauf aufmerksam, daf sich der 
Feldindex fiir Teilchen abweichenden Impulses àndert. Bezieht man die Uber- 
legungen auf eine mit abgeàndertem Impuls durchlaufene Bahn, so wird z.B. 
bei Gewinnung der fur die 2-Schwingung giiltigen Gleichung durch ein etwas 
anderes, im Verhaltnis der Impulse geàndertes, magnetisches Feld dividiert. 
Die gleichzeitige Anderung des Kriimmungsradius und der Bahnlange ist bei 
hohem Feldindex vielfach zu vernachlassigen. Die Feldindizes n,, und n, 
(G1. (E.3.)) sind also zu multiplizieren mit p,/p, wobei p, den Sollimpuls, p aber 
den wahren Impuls des Teilchens bedeutet. Wegen 


(A.1.24) Nrz = Nar; Mps cp Ta 


iibertrigt sich die Anderung des Feldindex auch auf die fiir die horizontale 
Schwingung mafgeblichen GròBen. Wegen der dichten Lage der Resonanz- 
punkte darf dieser Effekt nicht ibersehen werden. 


b) Bericksichtigung der Beschleunigung. — Bei Beriicksichtigung der Be- 
schleunigung der Teilchen ist die Bewegungsgleichung (A.1.15) zu ersetzen 
durch 


(A.1.25) (mR) = cR xH+ cE. 


Die beschleunigende Spannung soll tangential zur Bahn (E = Et,) wirken; 


(*°) Diese Bewegungsgleichungen stellen noch nicht véllig den allgemeinsten Fall 
dar, da das magnetische Fiihrungsfeld im Prinzip auch eine Komponente parallel zur | 
Gleichgewichtsbahn besitzen kann. Nach freundlicher persònlicher Mitteilung durch 
Herrn E. D. Couranr kénnen die Bewegungsgleichungen aber auch dann, indem sie 


fiir ein mitgedrehtes Bezugssystem formuliert werden, in die oben angegebene Form 
gebracht werden. 
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es gilt dann fiir ein nicht ausgelenktes Teilchen 


(A.1.26) : (mè) = eE . 


Bei Aufstellung der skalaren Bewegungsgleichungen ist jetzt einerseits der 


Term mR zu berucksichtigen, andererseits ist bei Bildung von R auch die 
Geschwindigkeit zu differenzieren. Somit erhilt man i Berucksichtigung 
von Gl. eK 26) 


(A.1.27 (mz) = © (Kye + Kat) 
ne MOT ev > 
(mr 0 os: (Ko + Kg) 


Die Bedingung, daB die Geschwindigkeit der Teilchen in jedem Augenblick 
gleich der aus (A.1.26) folgenden Sollgeschwindigkeit ist, (d.h. daB wiederum 
Gl. (A.1.21), jetzt aber mit zeitlich veranderlichem », gilt) erlaubt wiederum, 
die erste Gleichung fortzulassen, wahrend sich die beiden verbleibenden Glei- 
chungen jetzt folgendermaBen schreiben 


a CUM 
(A.1.28) CA IE de 


1 : H, 
(mr) = eS [Fue =e (Hs su = ) : 
¢ 0) 


Beide Gleichungen sind vom gleichen Typ. Es gentigt daher die Diskussion 
etwa der ersten Gleichung. Die Zeit # soll wieder durch die Koordinate d aus- 
gedriickt werden, wobei jetzt die zeitliche Anderung der Geschwindigkeit v zu 
beachten ist. Man findet 


aa) 
(A.1.29) (mz) == R È Rene 
und somit 

(A.1.30) 2" a — (N28 + nt) = 0; 


vel. Gl. (E.3). Es ist bemerkenswert, da8 hier nur noch der Teilchenimpuls p, 
auftritt, nicht aber Masse und Geschwindigkeit einzeln. Einfilhrung der Funk- 


tionen 
(A.1.31) Z(P) = V/ pel) , r(P) = vPr(d) 
verwandelt Gl. (A.1.30) in 


(A.1.32) i {(n+ +(2 ) (NE ! nal roi 
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Entsprechend gilt | 


Man hat also die Gleichungen einer gewòhnlichen Betatronschwingung ohne 
Beschleunigungsspannung, wobei die Feldindizes gegeniiber den im stationaren 
Problem auftretenden Groen allerdings etwas verandert sind. Der Einflu8 
dieser Zusatzterme auf die Gestalt der Bahn und auf die GròBe w ist fir hin- 
reichend langsam veranderliches p vernachlassigbar klein. In diesem Sinne kann 
hier von einer « adiabatischen Behandlung » gesprochen werden. 

Wiirde sich der Feldindex n(9) wahrend des ganzen Beschleunigungszyklus 
nicht dndern, so kénnte aus Gln. (A.1.31) und (A.1.32), (A.1.33) abgelesen 
werden, daB die Betatronschwingungen wahrend des Beschleunigungszyklus 
streng umgekehrt proportional zur Quadratwurzel aus dem Teilchenimpuls 
gedimpft werden. Tatsachlich wird sich der Feldindex wahrend des Beschleu- 
nigungszyklus àndern (unter anderem wegen Sattigung der Magnete). Aber 
auch dann wird der Ubergang von den formalen Lòsungen der Gln. (A.1.32), 
(A.1.33) zu den wirklichen Bahnen durch einen Faktor p-* vermittelt (G1. 
(A.1.31)). Dieser Daimpfungsmechanismus wird als « adiabatische Dampfung » 
bezeichnet und wurde wohl zuerst von KERST und SERBER (**) bemerkt. 


ANHANG 2. 


Zur storungsmassigen Behandlung kleiner Anderungen des Feldindex. 


In Teil I, Abschn. la, wurde ein Verfahren der Stòrungsrechnung skizziert, 
das sich auf die Untersuchung des Einflusses kleiner Anderungen des Feldindex 
anwenden 1a8t. Die genauere Durchfiithrung erfolgte in Teil I nur fiir die 
instabilen Bereiche, die infolge kleiner Fehler des Feldindex auftreten. Einige 
weitere Anwendungen und mathematische Ergànzungen sollen in diesem Anhang 
dargestellt werden. Unterabschn. a wird sich mit der Auswirkung von Ande- 
rungen des Feldindex im Innern eines stabilen Gebiets befassen, wobei im 
Gegensatz zu der Anwendung im Hauptteil der Arbeit angenommen werden 
soll, daB das ungestérte Q nicht gerade ganz- oder halbzahlig ist. Unterabschn. b 
wird die Auswirkung solcher Anderungen an der Grenze eines stabilen Be- 
reichs behandeln, wobei entweder an eine Bereichsgrenze des ungestérten 
Beschleunigers (4 ein ganzzahliges Vielfaches von x) oder an die Grenze eines 
instabilen Bereichs infolge Fehlern des Feldindex gedacht werden kann. Unter- 
abschn. ¢ wird mathematische Erginzungen zu der im Hauptteil dargestellten 
Storungsrechnung zur Gewinnung instabiler Bereiche bringen. 


a) Das Innere eines stabilen Gebiets. - Die Rechnungen dieses und des 
folgenden Unterabschnitts lassen sich sowohl auf einen vollen Umlauf als auch 


(5) D. W. Kersr und R. SERBER: Phys. Rev., 60, 53 (1941). 
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_ —bei streng periodischem Feldindex und streng periodischer Anderung — auf 
einen einzigen Periodizitàtsbereich anwenden. Im zweiten Fall ist statt 
Gl. (1.2.1) zu fordern 


(A.2.1)  7(9 +O) = exp[iZ] 79), 7.0+9) = exp[—i7] 7.(08) 
und statt Gl. (1.2.2) zu setzen 


| r1(9) = 0,(9) exp [iu 8/9], 7 (9) exp [17 3/05] , 
lr) = 0,(9) exp[— iu 9/9],  7:(9) = (9) exp [-- i71.9/9,] . 


Um die nachfolgenden Uberlegungen geniigend allgemein zu halten, soll in den 
Gln. (1.2.3) bis (1.2.10) statt Q bzw. @ die Bezeichnung q bzw. q eingefithrt 
werden. Bei Anwendung auf einen einzigen Periodizitàtsbereich ist dann zu 
setzen 


A.2.3 ei 
( ) q Dy? que Oo 


Ferner soll das jeweilige Periodizitàtsintervall (0, baw. 27) allgemein mit 6, 
bezeichnet werden. 
Die zweite Gl. (1.2.8) oder 
(A.2.4) BP") + 29) — (MA) + ATA 
= — i (P) + 29° q0 (9) + 6n(9)v, (9) 
kann in geschlossener Form nach %(9) aufgelòst werden. Bezeichnet man 


die rechte Seite fir den Augenblick durch f(?), so findet man in leichter Verall- 
gemeinerung der Gl. (1.1.2) 


(A.2.5)  OP(8) = aiwi(9) + aw.(9) + 


d 
+ |dA(W(I)){(PY{w1 (8) (8) — w2(9") 01 (9)} , 


0 


wobei fiir die aus zwei linear unabhangigen L6sungen w,(?), w,(%) der homo- 
genen Gleichung gebildete Wronskideterminante gilt 


(A.2.6) W(d) = W(0) exp [— 2i99]. 


Zweckmabig wahlt man 
(A.2.7) w(d) = 0,0) , w(9) = v0) exp [— 2699] ; 


die aus w;(9), w.(0) gebildete Wronskideterminante ist dann fiir ? = 0 mit 
der Wronskideterminante (H.9) identisch und soll deshalb kurz mit W_ be- 
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zeichnet werden. Gl. (A.2.5) schreibt sich somit 


(A.2.8) (9) = al? 0,(9) + ai? va(P) exp [— 2690] + 
3 
“+ = di’ exp[2ig®]{ 217 v1 (8) + 2G q0,(0") + dn(9) 0, (8')} - 


0 


-{0,(0") (9) exp [— 2igh] — v:(9')v1 (9) exp [— 2iq9']} - 


Jetzt ist die Periodizitatsforderung (1.2.9) — fur 0 statt 2% — anzuwenden, 
aus der nach entsprechenden Uberlegungen wie inZusammenhang mit Gl. (1.2.13) 
einerseits folgt 


do 
(A.2.9) Ja 249 v1 (0) + 29° q01 (9) + 6n(9') 0(9')} 0.(9') = 0, 
0 


und andererseits fiir den Koeffizienten a‘? 


do 

Jada 279 018) + 29% G0,(8") + 6n(9") 01 (8')} 01 (9) exp [2ia8"] 
(1) 0 - np b 
Ane LO a W(exp SE 


Der Koeffizient aj) bleibt unbestimmt; das riihrt daher, daB man jede Lòsung 
des gestòrten Problems mit einem konstanten Faktor multiplizieren kann, der 
bei verschwindender Stérung gegen eins geht. Ein analoges Verhalten ist von 
den héheren N&herungen der Schròdingerschen Storungsrechnung der Quanten- 
theorie wohlbekannt. Es sei noch ausdriicklich angemerkt, daB Gl. (A.2.10) 
nur sinnvoll ist, wenn exp[2iq#,| von +1 verschieden ist; es darf also keine 
« Entartung » im Sinne von Teil I, Abschn. 2a vorliegen. 

Aus Gl. (A.2.9) ist die Anderung qo ° der GréBe q zu bestimmen. Anwendung 
der Periodizitàtseigenschaften von v,(%), v.(0) und der Definition der Wronski- 
determinante liefert zunàchst 


do 
(A.2.11) | (01(9) + iqv:(9))o()dd = ac 


(ein Spezialfall ist die zweite Gl. (I.2.17)) und somit 
aes 8 
~ 1 
A.2.12 MERE E (GI ee 
( ) q iW, | On(P)v1(P) vd) dd iwo, | On(P) 14 (9) r(9) dd . 
0 


Sieht man 6n(#) hier insbesondere als eine streng periodische Anderung des 
idealen Feldindex an, sei also òn(?) periodisch mit 4, so kann von 
Gl. (A.2.3) Gebrauch gemacht werden und man erhilt 


(472.13) TS ae fra )ry(P) r(P) dd . 
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Die Anderung der Zahl der Betatronwellenla ngen auf dem Umfang ergibt sich 
in dieser Naherung wegen Gl. (E.12) dann zu 


= Jane 
(A.2.14) Qua _ oniw | On(P) 1, (9) ro(9) dd 
0 


in voller Ubereinstimmung mit Gl. (1.2.24). Diese Gleichung war jedoch 
damals speziell fiir ganz- oder halbzahliges @ abgeleitet worden. Ubrigens 
erlaubt auch Gl. (A.2.10) eine vereinfachende Umformung. Es ergibt sich 


do È 
aq (710)? Jnl) ve 


A215 i 
( ) 42 iw W (exp [ 270, | =) 


Aus Gl. (A.2.13) kann eine Aussage iiber das Vorzeichen der bereits als 
iImaginar erkannten (*) Wronskideterminante gewonnen werden. Zu diesem 
Zweck werde = 


(A.2.16) On(9) = dk-n(9) 


gesetzt, also n(d) iberall mit dem gleichen nur wenig von eins verschiedenen 
Faktor multipliziert. Das dann entstehende Integral kann unter Benutzung 
der Gl. (E.6) fur die freien Betatronschwingungen und durch nachfolgende 
partielle Integration umgeformt werden i 


Bei der Umformung muB beachtet werden, daB das Produkt r;(3)r(9) in do 
Do 
periodisch ist. Wegen r.(2) = r#(d) ist nun das Integral {ri(0) rs(8) dd sicher 


positiv. Somit gewinnt man folgende Aussage: Die Di den Floquetschen 
Lésungen 7;(?) und r,(?) gebildete Wronskideterminante W stellt eine mit der 
imaginàren Einheit multiplizierte positive GròBe dar, wenn die fiir r,(9) nach 
Gln. (E.10) und (E.11) definierte GréBe uw bei gleiehmaBiger VergròBerung des 
Feldindex (6k > 0) abnimmt. Derartige Vorzeichentiberlegungen spielen eine 
Rolle bei Untersuchung derjenigen instabilen Bereiche, die durch eine Ver- 
windung der Sektoren gegen einander hervorgerufen werden. Ubrigens erkennt 
man aus den vorgetragenen Uberlegungen auch, daB sich u bei einer solchen 
monotonen Anderung des Feldindex innerhalb eines Stabilitàtsgebiets selbst 
monoton andert. 
Die Berechnung héherer Niherungen verlàuft in genau der gleichen Weise. 
ae Periodizitàtsbedingung verhilft einerseits (mittels der fiir den Faktor von 
v,(0) geltenden Gleichung) zur entsprechenden Néherung von 7, wahrend aus 
Ack anderen Gleichung eine Aussage tiber den einen Entwicklungskoeffizienten 


(37) Vgl. Teil I, Bemerkungen vor Gl. (1.1.4). 


oo 


gewonnen wird. Auch in den héheren Naherungen bleibt nattirlich eine Un- 
bestimmtheit der Koeffizienten. Es ist zu vermuten, daB die entstehende Reihe 
fiir kleine Anderungen 6n(#) konvergiert, wobei der Konvergenzbereich aber 
nie bis an die Grenze des nichsten instabilen Bereichs heranreichen wird. 
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b) Die Grenze eines stabilen Gebiets. — An der Grenze eines stabilen Gebiets 
soll wieder — wie in Teil I, Abschn. 2a —q zu null umdefiniert und die 
Periodizitàtsbedingung fiir ;(9), ©;(9) néòtigenfalls durch eine Antiperiodi- 
zitàtsbedingung ersetzt werden. 

Es sei zunàchst angenommen, daB an der Grenze des Gebiets alle Losungen 
periodisch bzw. antiperiodisch sind. Es wird sich zeigen, daB dann bei einer 
gleichmaBigen Anderung des Feldindex nach Gl. (A.2.16) keine Grenze eines 
stabilen Gebiets vorliegt. Tritt daher bei einer solchen Anderung eine echte 
Grenze des Stabilitàtsgebietes auf, so kann es dort nur eine einzige periodische 
bzw. antiperiodische Lòsung geben, wahrend fir alle anderen Lòsungen eine 
Gleichung vom Typ der Gl. (H.14) gilt. Damit ist dann eine am Schlu8 der 
Einleitung aufgestellte Behauptung bewiesen. 

Es sei also vorausgesetzt, daB alle Lésungen an der Gebietsgrenze periodisch 
(antiperiodisch) sind. Dann lassen sich die Rechnungen von Teil I, Abschn. 2a 
ohne Anderung ibernehmen. Gema8 Gl. (1.2.19) findet man 


(Î8n(0)r,(9r(9)49)— | fn (8) (0) 240 |? 
(A.2.18) Gor = = : 


(2riW)? 


Entsprechende Umformungen wie in Gl. (A.2.17) zeigen nun, daB der Zahler 
fiir òn(9) gemafB Gl. (A.2.16) nach der Cauchy-Schwarzschen Ungleichung nie 
negativ ist. Deshalb ist 7 reell und man hat bei kleinen Anderungen 6k stets 
Stabilitàt; es liegt bei dieser Anderung des Feldindex also keine Grenze des 
stabilen Gebiets vor. 

Nunmehr sei angenommen, daB nur eine einzige periodische (antiperiodische) 
Lésung an der Bereichsgrenze existiert. Neben dieser Lòsung werde eine 
zweite, 7,(0), verwendet. Gema8 Gln. (E.13) und (E.14) gilt 


(A.2.19)  rrex(d +0) = trad), PoP + A) = + (ro) + ar,..(B)) . 


Die GròBe W bedeute die aus diesen beiden Lésungen gebildete Wronskideter- 
minante. Man iberzeugt sich leicht, da& der Quotient a/W von der speziellen 
Wahl der zweiten Lòsung 7,(¥) unabhangig ist. Setzt man jetzt die Stérungs- 
rechnung nach demselben Schema wie vorher an, so gelangt man im allge- 
meinen (falls nicht eine spezielle Bedingung fiir òn(9) erfiillt ist) zu einem Wider- 
spruch. Es ist also eine andere GréSenordnung in der Entwicklung von q 
anzunehmen. Die zweite und dritte Gleichung des in Gl. (1.2.8) angegebenen 
Stérungsschemas soll jetzt lauten 


| dA ITA = — 241G7,,.(9) , 
BP" (8) — NOEL (I) = — 279 1, (9) — 249 BO’ (9) + 
| + 9° r0,(9) + INI) (9). 
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Die Lésung der ersten Gleichung lautet 


(A.2.21) OP (9) =a? 7,..(9) + aLe (9) = 


(A.2.22) cae 


wobei die GréBe G™ in dieser Naherung noch unbestimmt bleibt und sich erst 
im nàchsten Schritt ergibt. Aus der Periodizitàtsforderung folgt jetzt keine 
zweite Gleichung, da gilt 


CA) 
(A.2.23) | ri (9)r (9) dd = 0. 


0 


Der Koeffizient a{? bleibt natiirlich wieder unbestimmt. Einfache Umformungen 
liefern fiir ©(9) den geschlossenen Ausdruck (bei spezieller Wahl von a‘?) 


(A.2.24) (0) = igo (ee) 


i Ora) 


Aus der zweiten Gl. (A.2.20) erhalt man 
(A.2.25) (9) = ar (0) + ar (0) + 


d 
e | dS (21915 (0) 21 GOEL 0) TT reel!) + OMO) rp (9) 
0 


; (Tper(O) ro(d) rai ro(d') Yrer(0)) LI 
und gewinnt aus der Periodizitàtsbedingung die beiden Gleichungen 


Go 


0= I (— ZITO (9) — WFO’ (0) +F pe (9) + I) yee") 
i r_(9) a", 
(A.2.26) ” per 
| a = — TAO TITO) + Tra) + 
| i + 510(9") Pyon(9")) Po) dI. 


Die erste Gleichung erlaubt die Bestimmung von qu. Nach Hinsetzen von 
Gl. (A.2.24) erhàlt man unter Benutzung von Gl. (A.2.23) nach einiger Zwi- 
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schenrechnung 
(A.2.27) (Ie) =— | (9) (ru: (0)) 248, 


wobei a/W, wie oben bemerkt, von der spezie'len Wahl der Funktion (8) 
nicht abhangt. Eine weitere Auswertung von Gl. (A.2.27) und die Berechnung 
hoherer Naherungen soll hier nicht erfolgen. 


c) Instabile Bereiche infolge Fehlern im Feldindex. — Es sollen einige 
mathematische Erginzungen nachgetragen werden zu den Rechnungen, die in 
Teil I, Abschn. 2 zur Ermittlung der instabilen Bereiche durchgefiùhrt wurden. 
Es soll nàmlich gezeigt werden, daB. sich die noch unbestimmt gebliebenen 
Koeffizienten aj? und aj; (vgl. Gl. (I.2.13)) und die zweite Naherung von Q 
wirklich aus der zweiten stòrungstheoretischen Naherung gewinnen lassen. 
Hierfiir sei Q®40 vorausgesetzt. Die Gleichung zur Bestimmung von 7{(#) 
lautet 


(A.2.28) FE) — n(B) BEB) = — QOBP' (9) + dn (IT (9) — 
= VBQOBO” (9) <i QTA (I) 


Es ist bemerkenswert, daB die Funktion %(#), deren Koeffizienten af? und a 
ja bestimmt werden sollen, auf der rechten Seite mit denselben Patoten auf- 


tritt wie die Funktion (9) in Gl. (1.2.8) (mit Qo = 0). Man sieht leicht 
allgemein, daB die Funktion ©‘ (9) in der Gleichung fiir die (n 41). Naherung 
stets in der Form — 2iQv{?'(9) + òn(9) (I) auftritt; die Ermittlung der 
Koeffizienten a{?, ay; erfolet daher auch in allen weiteren Schritten nach dem- 
selben Schema. Der in der jeweiligen Nàherung zu bestimmende Beitrag 
QD zu Q tritt dagegen stets in der Loi — YY (9) aut. 

Auf der rechten Seite der Gl. (A.2.28) sollen jetzt P(%) und vi (0) aus 


Gln. (I.2.12), (1.2.13) eingesetzt Sa Pica nur derjenige Teil, der Qe oder 


die Koeffizienten af), ad enthalt, explizit ausgeschrieben sei 


(A.2.29) LE (8) — n(9)T (8) = — 219 (ar (9) + atrs(9)) + 
+ dn(P) (arr (8) + aBrs(9) — 249 (aL (9) ) + aizr,(8)) +... 


Als Lésung erhalt man 


d 
(A.2.30) oP (8) = aPr.(9) + a®r,(0) + 7 {0 [— 219% (at r(O') + at rs(9')) + 


+ dn(9) (are (0) + att ro ) — 219 (al? VARA 
-(r4(9") r2(9) — ra(8')r4(9)) 5 


. . (2) (2) . . CI 
die Koeffizienten 4,7, a sind aus der dritten Naherung zu bestimmen. Die 
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Periodizitàtsbedingung (1.2.9) liefert die Gleichungen (benutzt wurde GI. (I.2.17)) 


(1) 1 TIT) Sav Pave 
(4.2.31) | ari Nar +12 (Na — 221WQ) = 2riWQDa® + ..., 
side 
1) TN) 7 VA) 
| aii(Nis + 2rri WQ®) + ays Noe = — 20iWQa +... 


Wiederum sind hier nur diejenigen Teile, die Q® oder af, af? enthalten, explizit 


angegeben. Die Koeffizienten des homogenen Teils dieses Gleichungssystems 
sind identisch mit denen, die im Gleichungssystem (1.2.18) fiir a®, a® auf- 
traten. Ein entsprechender Sachverhalt wiederholt sich in allen h6oheren 
Naherungen. Hieraus folgt zunachst, dab Tee zu einer Lòsung af, af; der 
Gl. (A.2.31) ein beliebiges Vielfaches von af}, af? hinzuaddieren kann; das ist 
wieder die Unbestimmtheit in den Koeffizienten der hòheren N herungen, die 
uns schon vielfach begegnet ist und deren Grund in Unterabschn. a angegeben 
wurde. 

Die aus den Koeffizienten des homogenen Bestandteils von Gl. (A.2.31) 
gebildete Determinante verschwindet; dies war in der ersten Naherung gerade 
die Bedingung zur Bestimmung von dv. Die beiden Gin. (A.2.31) sind daher 
im allgemeinen gar nicht mit einander vertriglich. Aus der Bedingung der 
Vertràglichkeit laBt sich Q® bestimmen. Ohne die Berechnung im einzelnen 
durchzufilhren, soll gezeigt werden, daB man auf diese Weise wirklich zu einem 
eindeutigen Wert von Qe gelangt. Da Q™ in der n. Naherung in der gleichen 
Weise auftritt wie Qe in der vorliegenden zweiten Naherung, ist damit gezeigt, 
daB sich auch in hòheren Naherungen keine neuen Schwierigkeiten ergeben. 

Man bezeichne die rechten Seiten des Gleichungssystems (A.2.31) mit 


(Bi, Bz, also 
(A232) B, = 201 WQ? a” +... , B, =  2riWQVa® + 


Die Lésbarkeitsbedingung fiir Gl. (A.2.31) lautet dann 


{ Bi(N + 2riWQ®)— BN, = 0, 


(A.2.33) ie 
| Bay, — B.(Na— 2a WE) = 0 


Die beiden Gleichungen sind linear abhingig, eine von ihnen kann jedoch 
unter Umstiinden identisch verschwinden. Es sei etwa angenommen, daB die 


GroBen N,, und N,, — Qi WOW nicht beide gleich null sind; es ist dann auch 
a® 40. Jetzt ist die zweite Bedingungsgleichung (A.2.33) heranzuziehen 


(A.2.34) 0 = BLN» — B(Na— ni WQ) = 
= Ii WO (a No» + a ( No, — 201 WQ)) + 


Unter Benutzung der ersten GI. (1.2.18) und Beachtung von N,=N, erhalt 
man somit 


(A.2.35) 0 = — 2(2riW)}?9Q®QMa® + ..., 
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wobei die fortgelassenen, durch Punkte angedeuteten Glieder wohlbestimmte 


Ausdriicke darstellen, die Q®, a{?, a‘? nicht mehr enthalten. Wegen der Voraus- 


setzungen Q” 40, a® 40 erlaubt Gl. (A.2.35) die eindeutige Auflésung 
nach Q. Koeffizienten a{?, at? kénnen anschlieBend aus einer der Gln. (A.2.31) 
gewonnen werden. 

Die so erhaltene Reihenentwicklung versagt in unmittelbarer Nahe der 
Grenze des instabilen Bereichs. Méchte man die Lage dieser Bereichsgrenze, 
uber Gl, (1.2.28) hinaus in héheren Néherungen berechnen, so ist in einer we- 
sentlich anderen Weise vorzugehen. Jedoch sei hierauf nicht eingegangen. 
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1. — Formulation of the Problem. 


It is well known that the factorial moments can be expressed by means 
of the Mellin transform (1) e.g. the first moment 


ao+ito 


i IS 
(1) Se org fas “i exp [w(S)a] . 


But this integral form of solution is not suitable for some considerations. 

In our paper we investigate the behaviour of the moments for large depth 
of the absorber. We take advantage of the equations satisfied by these mo- 
ments. No use, however, is made of the expressions, like (1). 

We confine ourselves to the case of nucleon cascade in homogeneous nuclear 
matter. The results may be generalized to the case of nucleon cascade in 
finite absorber and to electron-photon cascade. 

Let us denote the k-th factorial moment in the cascade theory formalism 

(1) See e.g. H. Musser: Proc. Roy. Irish Acad., A 54, 125 (1951); Proc. Phys. Soc., 
A 64, 726 (1951), A 65, 465 (1952). 
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of JANOSSY (2) by S;(¢, 2), i.e. 


(2) S,(e, ©) aS 1)... (n—k+1)P(n|e, @) 


n=k 


where P(n|e, #) denotes the probability that at depth x there are n nucleons 
of energy > e due to one primary nucleon of energy 1. 

We present the solution of the question suggested to us by LOPUSZANSKI. 
We have proved that for 0< «<1 and k=—1, 2, 3,...[e"] 


(3a) Sx(e, a) = e-*W,(e, 2), 


where for each f > 0 


(3b) lina te PW Ce). 

and 

(3c) Lima) cor 
B—> 00 


The applications of (3) are given by LOPUSZANSKI in two subsequent papers (5). 


2. — The Proof. 


In order to demonstrate (3) it is sufficient to show that for 0< «<1 and 
ee] 


(4a) | co for A= 1 
lim ¢7*8,(¢, 2) = 
(4b) ae | 0 for A= a7. 


We get from the integral G-Equation of JANOoSSY (*) the integral equation 
satisfied by the factorial moments for 0< «<1 


(5) Spe, 6) = fera ffne, aye 


k k € 
do (") 8,(5; 6) SEI (5 5) de’ de” dé _ Oke-* , 
where S,(e, €) = 1. 


(2?) L. JANossy: Proc. Phys. Soc., A 68, 241 (1950); A 66, 117 (1953). 
(3) J. LOPUSZANSKI: see this issue, pag. 1150, 1161. 
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To prove (4a) we multiply both sides of (5) by e*; consequently the right- 
hand side of (5) depends on # only by means of the upper limit of integration. 
In this equation the subintegral functions are positive and therefore it follows 
that W,(e, 7) is an increasing function of increasing x. It follows from (5) that ; 


dt x 


Wi(e,2) > | | w(e’, e") de": | w (5, s)de de’ , 


oe 


0 0 0 
and the right-hand side of this unequality tends to infinity, when a tends 
to infinity. 

From the definition (2) we get 


S,(8, x) < Le 8a (e, x) 


consequently it is sufficient to show (4b) for k=1 only. 
Let us perform a transformation of S,(e, 7) 


1 : es 
VI) =e [ Sule, ajerde, 


0 


where 7 is a sufficiently large natural number and /4< 1. 
We show that 


(6) lim M(x) 207 pie 


aw—> oo 
From (5) we get the equation for M(x) 


x | 


Mie) + | K(x — £)M(é)dé = O(@) , 
where 


(col 


[| K(a)|de< co, 


0 


lim C(a)= 0 


and the Laplace transform of K(w) is different from (— 1) in the right-hand 
halfplane. Accordingly to the theorem of PALEY and WIENER (*). (6) follows 


immediately and this completes the proof. — 
A more detailed paper concerning the same problem will be published, ™ 


Studia Mathematica. 
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1. — Introduction. 


In our considerations we confine ourselves for the sake of simplicity to the 
case of a nucleon cascade in homogeneous nuclear matter. The results given 
below may be generalized to the case of a nucleon cascade in finite absorber 
and of an electron-photon cascade. 

The development of the nucleon cascade is described by JANOSSY’s 
G-equation (*): 


Ls 
0G (Ee, u, €) | p 
(1) 3 toe ) G(e, U, x) = | fete, A) de’ de” G(ele', u, x) Cele", u, x), 
00 
where 
G(e, u, %) = > P(n|e, x)ur 
n=0 


(1) L. JANOSSY: Proc. Phys. Soc., A 63, 241 (1950). 
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is the generating funetion for the probabilities P(n|g,) to find » nucleons 
of the cascade with energy >e at the depth x, provided that the cascade is 
due to one nucleon of energy 1, and w(e', e”) is the cross-section for nucleon- 
nucleon collision in which the secondary nucleons receive the e‘-th and e"-th 
part of the energy of the incident nucleon. We have further 


VEE O) TONER 
we e) = 0 forpelzi 
eta 
JI we,e)dede=1. 
dd 


The solution of equation (1) in the form of a recurrent formula reads for 
al ey (7) 


(2) Ife, u, ©) = (1 — u)S,(e, 2) Je] E _ 8; oe i) 
ds niet «SE | 
(e) 
ti : 
where 


I,(e,u,xax)=1-G(e,u,2) 


for e > 1/n and S;(e, 7) denotes the first. moment of the distribution. We 
outline the derivation of this formal solution (2) in the Appendix. 
We denote lim /’,(e, u, x) by Ie, u,x). The relation Ile, wu, 2) = 
Nn> 0 


= 1— G(e,u,@) is valid for all positive e. Then (2) goes over into an integral 


equation for Je, w, x) 


@ Pomo 0 mae — [ag [(— A 


0 0 


Jef. aa re, U, e)F (1,08). 
Di "n e 


Equation (3) is equivalent with the integral equation given by JANossy. It is 


suitable for our purposes. 
It is very tedious to get numerical results from solution (2). We show, 


(2) J. Lopuszanski: Acta Phys. Pol. 12, 156 (1953); 14, 191 (1955). 
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however, in the next paper that for large depth of the absorber we are able 
to derive from this solution relatively simple expressions for the distribution 
function. To this aim we must consider two problems dealt with in this note. 


2. — Formulation of the problem. 


It follows immediately from (3) that for each treshold energy e > 0 the 
cascade must die out. We have namely 


0 <lim Le, u, x) < (1— u) lim Sie, @) = 0 


a> oo B—> oO 
(with respect to the last step see (*)), consequently 


lim Ges a, 2) 1 
qt>o 
and this yields the required result. 

It is, however, interesting to know whether the probabilities P(n|e, x) for 
n>0 tend to zero equally strong or not. To investigate this problem we 
introduce the conditional probabilities 

P(n|e,® P(n| e, & 
Pane ce pene) = (| €, €) 


> P(m\ ¢, x) E650) &) 


mel 


to find n nucleons of energy > e at depth x, provided at least one nucleon 
of energy > e at depth « exists. 
We show that 


: Lise) #) | 
4 lim [ene == 1) 
(4) scalo) 
for e' > e>0 and 
oe | 
— TA co 
© limi P*(n|.2,2) = 0 for n > 2 BME als 


@—> 0 \ 


The necessary and sufficient condition that (5) should occur is 


(6) ie Sie, os o ee E = 


(8) K. URBANIK: see this issue, pag. 1147. 
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This becomes evident if we take into account that 


Si(é, v7) = 1 + > (Ei 


n=2 


— 
—] 
— 


The number of terms on the right-hand side of (7) is finite, because for each 
e> 0 all P*(n|e, #) = 0 for n>[e-']. Further all this terms avenon-negative. 
Let us therefore deal with (6). 


3. — Estimation of /'(¢, 0, 7). 
In paper I it was shown that 
(3a) Si(é, x) = eW,(e, x) 


where for each fp > 0 


(30) lim e-#W,(e, 2) = 0 
and 
(Se) lim Wi(e, 0) = co 


for 1>e>0. One may extend this result to the function (e, u,v). For 
our purposes it is, however, sufficient to show that 


(9a) NIRO MER 
where for each gp > 0 
(90) lim e-#V(e,4) = 
wT 0 
and 
(9e) hind Vi (2.4) = oor, 


To show (9) we notice that 
(10) Ie, 0, a) <8,(e, 2) <[1/el£(e, 0, 2). 
From (8), (9) and (10) we have 


(11) 0 <lim e~*V(«, x) < lim e~”* W,(e, #) = 0 


TÀ co TS 
and further 


(2) [1/e]lim V(e, #) > lim W,(e, 2) = 00 


Fame de.) 


(11) and (12) complete the proof of (9). 
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4, — Energetie relations. 


According to (1) I(e, 0, x) satisfies the following equation, 


1 


(13) A el be iF eit?) = | (ey ae T(c/e', 0, 0) 
On i 
0 
a8 
— | | F(e/e', 0, vw) (e/e", 0, #) wle', e’) de’ de”, 
00 
with 


1 
w(e') = [fore é") + w(e", e') \}de". 
0 


Using the notation of (9a) the equation (13) takes the form 


(14) n: È) = w(e!)de! Viele’, a) — 
Ù ad 


— e° ) [ Viele’, a xv) V(e/e", x) we’, e") de’ de’. 


0 0 


Let us divide both sides of (14) by V(e, x). 
Since 


1 


1 
[feta 2) Weis i a we’, &') de' de" < [ae V(e/e', x) e-ae we’, e"), 
V(e, a) o 
0 0 
the last term on the right-hand side of (14) tends to zero if + tends to infinity. 
We make now the assumption that the limit of 


ii V (e, 2) 
Vie, x) dx 


exists if # tends to infinity. We justify this assumption in the following 
manner. There are only two possibilities, namely either the limit exists or not. 
We consider the first case, because this case seems to us physically more pro- 
bable. We denote this limit by 


Tee ven 
A == | f E DE È 
so V(68) 9 
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In respect to (9c) A can not be negative. On the other hand from (9b) it fol- 
lows that A can not be positive too. Consequently it must be equal to 0. 
In this way our equation (14) reduces to 


The sufficient but not necessary condition that the limit and integration ope- 
rators commute, is the uniform convergence of 


V(e/e', 0) 
V(e, x) 


when « tends to infinity. Our expression does not fulfill this condition. 
Nevertheless we are able to show that the interchange of operators is per- 
missible. We get from (15) 


al 
0 
and finally, making use of that /‘(e,0,#) is a noninereasing function of e, 


f Le, 9, w)\ _ 
O) TAO 0, a) f 


for e >e>0. We have the result stated. 


5. — Probability distribution function. 


We show now that (4) yields (6). l 
To this aim we must make use of our integral equation (3). We put w—0 


and divide it by Ie, 0, x). We get 
he il 


(16) 1—= S#(e, 2) — ag | ay(— 


0 0 


Os, (1) > 
On 


"f fe e'\de'de” Fe/e’, 0, €)L(e/e', 0, È 
{fole 2) ne 6 P(e, 0, 0) 


Let us consider the second term on the right-hand side of (16). We show that 
this expression tends to zero if # tends to infinity and in this way complete 


the proof. 
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We write this term in the form 


x 


rfae Sone Mln. p(£ (E08 
7 n Cee a — = È are ), E 0, È 
(17) Dy, €, & )| ae Ie, 0, ©) i e ( 5) 1 2 


0 


where D(y, ¢, €) denotes the operator 


Using the notation of (8) and (9) the term (17) goes over in 


ce) iT Sey ; _— OW,(n, & —&)[on (E 
(18) DM, &', € | dé Hani 3 (=, 5)J ‘lke oe ges 
0 
It follows from the differential equation satisfied by — 0W,(y, #)/07 that 
— 0W,(n, «)/0n is a nondecreasing function of increasing 7. This equation 
reads 
1 
=(— Wi 22) a E de' | =» OW. AVILE =) 
Ox Ot DO o o(n/e') 
0 
Consequently 
— 0W,(n, @)/ en È 
19 18) < Din e (e, (E, ere. 
(19) (18) < Dy, 8,8) Tio) n EV (ia, E) esa 
Let us estimate the integral 
. € f € / 
0 


ZEN NG 7 ‘E a E 7 € eat 
< 3 Max (1 (5, 5) e-53) Max Ù + 5) e-sh) 
Ne BS oa \ E 0<E<Z € 


and the last expression is bounded because of (9b). Due to this circumstance 
the right-hand side of (19) vanishes for « tending to infinity, if 


. {— Wim &)/en | 
ele, ae vel eee 


We take into aecount that 


J Sul &) | _ 
ep \e. 0, ae 
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for n > e occurs because of (4). In order to show that the relations 


SUA wv) = 0S, (17, w)/ 07 
lim ) | nS 
Te, 0, e) J “ia 


20 ) 
(20) dim nya = 


0 
> 


hold, it was sufficient to show that 


Si(q, € x) 
Ie, 0, x)’ 


converges for tending to infinity, has continuous derivative in respect to y 
and that this derivative converges uniformly for x tending to infinity. The 
dangerous places of expression (20) lie in the neighbourhood of Hip Sk Le 
loan; a 


OW = 1, x) 
CH 


shows Dirac-delta singularity for all vw But we are able to overcome this 
difficulty. We outline only the proof because this matter is rather interesting 
for a mathematician than for a physicist. We divide 

Wi, ©) 

Vie, Oe) 


jnto two parts: the continuous one 


Wm 2) 
Vie, 0, 2) 


and a step function 


Wi°(y, a) 1 fel 
— = 4 ax(@)H(Mn —n, 

Ve, 0, x) V DI x:( ) ( / Tic) 

where H(z) denotes the Heaviside’s function. We consider these two functions 

separately. We put 

OW, 0 4%, 0 WwW? 

On i On on 

into the right-hand side of (19). What belongs the singular part we are able 

to perform the integration in respect to 7 and show that this part vanishes 

for x tending to infinity. We notice that the number of singularities is finite 

for each ¢>0, namely <[1/e]. The continuous function 


OWN, &)/ On 


V(e, 2) 


leads to no difficulties: (20) holds for it. 
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For n<e +e" as well for e'<e and e’< « expression (17) vanishes, 
consequently we employ in (18) as well as in the right-hand side of (19) the 
value of 4 > 2¢ only. 

Let x in formula (19) tend to infinity. Then 


co 


ae iin ee ETI lee eee r(5 s)e taé— 0 
Dn, E a È pi Vie, x) ° | le Ss sl? Ss 5 3 


and we have the result stated. 
In this way we have completed the proof of (5) and we can finally write 
for large « 
| P(Ole, 2) ~ 1 
(21) P(1|e,a) — Sy(e, x) 
| P(nie,a@) = (27) for 7 4. 


6. — Conclusions. 


We have proved (4) and (5). e‘, 0, x)/I(e, 0, x) denotes the conditional 
probability that we find at least one nucleon of energy > «’, provided that 
there is at least one nucleon of energy > e where e' > e at depth «a. 

Let us now consider the physical meaning of these two theorems. 

The relation (4) states: for very large depth of the absorber if we find at 
least one nucleon of energy > e we may be sure that the energy of the nucleons 
lies in the infinitesimal energy range about e i.e. (e, e+de). 

The relation (5) or (21) states: for very large depth of the absorber if we 
find at least one nucleon of energy > e we may be sure that there is only one 
nucleon. 

Both relations taken together state: if we find for very large depth of the 
absorber at least one nucleon of energy > e, we may be sure that there is only 
one nucleon his energy lying in the infinitesimal energy range (e, e+de). 

Formulae (5) or (21) give asymptotic solutions for the distribution function. 
This result may be interesting from the point of view of the theoretical phy- 
sicist. The experimental physicist, however, requires among other numerical 
data for large but finite depth of the absorber. Our solution realizes the first 
step in an approximation procedure for large but finite #. It seems very pro- 
bable that the next step in the approximation will be given by the second 
factorial moment, the next following by the third and so on. The second part 
of this paper is devoted to this problem. 
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The author wishes to express his sincere thanks to Professor J. RzEWUSKI 
for continued encouragement and helpful discussions and to Mer. K. URBANIK 


and H. STACKOWIAK for helpful discussions. 
APPENDIX 


The G-equation reads 


so 


0G (€, u, x y ae Pate Dure 
{1) ARR ) + Ge, u, 2) = rete", e")de' de Gis U, ® a(S. U, 2) x 
dx E n € ; 
00 
The boundary conditions are 
; u for eta 
(L.A G ie J 
‘ia i E 


For ¢>1, (1) and (1.A) will be of course satisfied by 
(CRUDI 
For e > 3 we introduce an auxiliary function 
Ue, uv, @) = 1— Gle, u, x). 


We express G in (1) in terms of /’, and find 


il 
Ol, (e, WU, & f 
(2.A) Da ae ale) | de’ w(e') I, (=, U, 2) — 
Me 
0 
gt È i 
| / the / Il E € 3 72: € a 
> | de’ de" w(e', &')Lal—, U, V)\Lel—, Uv]. 
\E € 
00 
If we take into account that 
west) == 0 fore + e">1 


and 
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then it becomes obvious that the last term vanishes. (2.A) reduces to 
it 
OL 3, 4, 2) 


(3.A) as +'D(e, us ©) = fw(e' de I a U, ») ; 


Cx € 


But (3.A) has the form of the equation for the first moment (*) only the 
boundary conditions are different. To solve (3.A) we apply the well known 
method of Mellin transform and find 
(4.A) I(eu, 2) = A —u)8,(e, 2). 

For « > 4 we introduce 


T3(e,u, ©) = 1- Gle, u,. x) 


and apply the same procedure as in the case of function /(£, u, 7). We 
obtain the equation 


1 
Sp ; 
9 OL (exits Qe se E 
(5.A) 5 ++ Ise, use) — ID (È, U J (ede 
Ou i € 
0 
aati 
€ td I iA I 
=—{{ Z|, 4, «|L,(—, u, @ | w(e','e") de de’. 
€ 
00 


Generally, for e > 1/n, we get for 
l'7,(€, u, ©) =1— G(e,u, x) 


an equation analogous to (5.A), namely 


: OL’, (€), ©) e 
(6.A) te; ne. + Le, U, o) i Di U, 7) w(e') de’ # 
Cx } (Aa 
0 
dii 
1 € E 
=== pei 3 U, A )La( a5 Ub. e) w(e', e') de’ ke! 
do 


4 
i 


We are able to solve (6.A) by means of the Mellin transform (4), provided we 
hold /7,_;(e, u,v) to be a known function. The solution of (6.4) is simulta- 
neously the formal solution of (1). It has the form given by (2). 


: oe Appendix is a summary of a paper submitted to the Acta Physica 
Polonica (*). 


(4) H. Messer: Proc. Roy. Irish Acad., A 54, 125 (1951). 
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Evaluation of the Distribution Function. 


J. LOPUSZANSKI 


Institute of Theoretical Physics of the Polish Academy of Science - Wroclaw 


(ricevuto il 12 Aprile 1955) 


CONTENTS: 1. Introduction. — 2. The Behaviour of the Factorial 
Moments. — 8. The Second and Higher Approximations to the Pro- 
bability Distribution Function. — 4. Conclusions. 


4. — Introduction. 


In a previous paper (LOPUSZANSKI ('), hereafter referred to as paper II) 
we have obtained asymptotic formulae for the probability distribution function 
for the nucleon cascade in homogeneous nuclear matter 

| P(O\e, 2) ~ J 
(1) ; P(1|e, 2) ~ S,(e, 2) 
| P(nje,a)= o(P(l|e,n)) for n>1 
(for notation see paper II). One may consider (1) as the first approximation 


for the distribution function for large but finite «. 
In paper II we have also shown that 


CAS lim [Le 
sis Iole) OF a) 


(1) J. LOPUSZANSKI: see this issue, pag. 1150. 


bs ponti 
7 
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To investigate the next approximations we consider the probabilities of 
finding at least two, three and s.0. nucleons of energy > e at the depth x. 
Tt is well known that 


oY ee 
G(e, u,v) =1+ > (—1) k! Sx(e.0) (1-—)* 


k=1 
where S;,(e, 2) = > n(n —1) ... (n —k+1)P(n| €, 4) denotes the k-th factorial 
n=0 
moment. Hence 
= ‘ee 
(3) ENS) thy ENS e on eae ae 
k=1 ve 


One may easily prove the following relation 


Te, u,è 
4,2) —[1— P(0|e, al) + PIRO St ea 


+[1— P(0|e,a) — P(1|e, x) — Pe, 0)]u +... 


This formula enables us to write down at once the expressions for the proba- 
bilities to find at least k (k = 1, 2,...) nucleons of energy > e at depth x: 


(4) *»: Pina Da du = Fey RICE, ( aa 
=) o )! 3 ame o H k— 1) 4 


2. — The Behaviour of the Factorial Moments. 


In paper II we have presented a new integral equation equivalent to the 
integral G-equation of JANOSSY (?) 


(5) Ie,ua)=(1—u)S(e 7) —D(n,e,e) 
fae (28m 2) pfe e 
Jes È sor ju (- Wa o) He RO e). 
0 u 


If we substitute for the /"s on the right-hand side of (5) the expressions for 1’, 
given by (5), and repeat this procedure infinitely many times, we get a formal 
expansion of /’ in powers of (1-—u). The coefficient of (1 — )? reads 0.2. 


a 


abe! € x , 


(?) L. JANossy: Proc. Phys. Soc., A63, 241 (1950). 
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and generally, the coefficient of (1— x)" 


® 


È DISP CR 
(7) - Sn(6:2) = D(n, &', #)f de|— ea send si 
ni \ on | 


0 


We show now the important relation 


(8) . lim ear) 


i | ==) fOr y= mandi s=090 
tao Sm(& a) f È 


To this aim let us prove the auxiliary lemma, that 


for w > 20’. 


ALC A “on 1 
(9) lim Prenat, SAI — 
tc | S,(o', x) J 


To prove formula (9) we take advantage of the fact that we may find such 
a sufficiently large, finite Y(w’) that for x > X the function — 08, (7, v)/ey is. 
a non-increasing function for all 7 > 26'> 0. Consequently we have 


X-X(®') 


Ita AS i ee setae 3 
5 Ss) = D(x, €, a | as (— so = dI (>, i Si ee , E) + 
of 81 (Mm @—8)) (È dm e a. 
| ae — ce i S, (3 deu (È mae Pm ese ( a 
RK) a-z0) RE 
[walt at) 
0 
Hence 


[pe } > Din, e e") lim fO81(9,.2)/02 | | 
(10) me ie SN (co, 2)/dwf 7 nye) € pane 108, (0, 2)/20f 


(co) 
f 


Jases(e: shale 8) 


It was shown in paper II that (2) yields 


if (0,0) _ 
DALE ASSI Oaths hen 
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Hence 


08, (0, w)/0y | 
area RI DI 


is different from zero and singular. Consequently we have 


11) “ne OS1(, « #)/00) |, NI Sui 0,0) | _ ; foSi(a, + i 0 = 
gi Um, 0,2) Jase |=- 08,7; 2)/enl 20 | 08,(m, #)/ er Dine 
TOES = 2H (>): 

The integral on the right-hand side of (10) is positive because the spinti 
functions are positive and continuous. Due to the properties of D(7, £', £°) 
(cf. paper II) 7 in (10) takes on values from the interval (20', 1). If in this 
interval a small but finite subinterval exists for which 7 < © then by virtue 
of (11), the right-hand side of (10) tends to infinity. In this case we have 
26" © and the result follows. 

The formula (9) can be generalized to 


A — 0S;(©, x)/0w 
(12) lim | SUES: Re == () for © > nw’ 
sol. Sala 


The proof is similar as in the case of (9). 
Starting from (9) we can deduce the following relations 


(13) lim [Sule 2) les == fore > 


EGR x)f 


Let us prove this relation. It is well known that S,(e, x) = 0 for e> 1/n. 
This becomes at once evident from (6) and (7) if we take into account that 
Si(eé, a) = 0 for-e> 1. We have for the integration intervals of the variables 
appearing in (6) and (7) the relations 


1>an>e+e >2e and 1>n>e'4e >ne resp. 


It was shown (URBANIK (4), hereafter referred to as paper I) that for 
Veg 


: il 
(14a) — Silea Wien) lie dz È 


(*) Our way to (11) was mathematically not rigorous. If we want to proceed more 
rigorously we must first prove that the operators 0/00 and lim commute. To show 


ES 
it we proceed in a similar manner as was indicated in paper II in analogous cases. 


(4) K. URBANIK: see this issue, pag. 1147. 
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where 
(140) lim f{e-6W,(e,&@)} = 0 tor, ded 
eee 
and 
(14e) lim W,(e,%) = co. 


Using (14) we can write 


(15) e (1/n!) S,(€, x) het | _ OS, (7, oe: 
Egg So Sle, 2) 


a 


n-1 


N ] € 
{cae wis, ale ee Bary (5, sles 
J a oi ba — k)! Vac is gi 


The integral on the right-hand side of (15) is bounded, because of (14), and 
positive for all 7 The right-hand side of (15) vanishes for x + co if 


(16) lim 


and this takes place for 7 > 2, according to (9). The relation (16) must hold 
for all 7 from the interval (ve, 1), consequently it is necessary that 


WE Z 2E 
and the result (13) follows. 


We may easily generalize the result (13) to (8): 


a E,W q 
lim J A i — 0) forn>mand e > 0. 


That is our main result. It may be justified in a similar manner as (13), 


making use of (12). 


3. — The Second and Higher Approximations of the Probability Distribution 
Function. 


Formula (8) involves in fact the solution of the problem of evaluation of 
higher approximations. Indeed, we have, according to (4), 


Ca 1 a dg 
(ET) > PI | vt) = » S2(€, 0) — 31 S3(€, 2) 4 4! Ny(€, 4) 


1=2 
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Let us divide both sides of (17) by $8.(e, #) and let # tend to infinity. Taking (8) 
into account, we get 

(18) lim 
We notice that the number of terms on the right-hand side of (17) does not 
exceed [1/e]. It follows immediately from (18) that 


{ PQ|e, «) ~ 38(6, a) 


Pn) = o(P(2 | €, @)) for n> 3. 


In the same manner we obtain generally from (4) 


| ‘ di Y x a 
(20) P(n| €, @) na Sn(€, €) 
(21) P(n|e,x) = o(P(m|e, 2)) 


for 2S Mm, n,m a= 1, 2, ... and ef U0. 


5. — Conclusions. 


Formulae (20) together with (8) constitute the main result of our note. 

It becomes obvious from (3) and (20) that for large x the successive terms 
of the expansion of /(e, u,v) represent the successive approximation steps. 

Equation (3) yields . 


To obtain the expansion of /(e, uv, x) in terms of S,(e, x) it is convenient 
to use either the formula (5) or (7). The series (22) are the s.c. Leibniz series. 
Taking (8) into account the ratio of the (x-+1)-th to n-th term tends to 
zero if x > co, consequently a fairly good convergence for large but finite x 
may be expected. The fact that (22) are Leibniz series admits at once an 
estimation of the series. We have e.g. for the function /(e,0,), which is 
of great importance in practical work, from (3) successively 


: : Se Il 
(234) Ne 2 Fe, 0,x)>S — is Nai 
i i ' I i ] 
Di I u aS Y We. Z Ù = 7 a ne > > i) ti 
(230) SI 9 195 6 Sg 2 1 (E, 0), x) = S, = 5 S, AI 6 Ss oe 4 S, 3 
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The first and second moments were exhaustively investigated by many authors 
(for nucleon cascade in homogeneous nuclear matter see e.g. (>%)). Also higher 
moments were already considered in some papers (see e.g. (7)). The results 
of these investigations enable us to estimate the distribution function accord- 
ing to (23). One may ask for which x the approximation (23a) and for 
which w (23b) or a better approximation is fittest. We are not able to answer 
this question but at any rate (23a) certainly does not fit for this «-interval 
where S,(e, 2) > 1. The same is also true mutatis mutandis for (230). 

Further, the question may arise, why the customary approximation method 
of Picard was not applied to the integral G-equation. This equation admits 
vet the application of this method, in accordance with the « fix-point-theorem ». 
Although this method seems to be a straightforward one, it is, however, not 
suitable for large x because the successive approximation steps consist of sums 
of graph representations of all possible cascade shapes (trees). It is, however, 
quite possible that, if we chose as the starting point for the Picard’s appro- 
ximation method the equation (5) instead of G-equation and as the first 
approximation —S,, we get our results. 

We hope that our method will be useful for practical purposes, especially 
for numerical computations of the distribution function for large depth of 
the absorber. 

The considerations given above can be generalized to the case of the nucleon 
cascade in finite absorber as well as to the case of the electron-photon eascade. 


The author wishes to express his sincere thanks to Professor J. RZEWUSKI 
for continued encouragment and helpful discussions and to Mgr. K. URBANIK 
for helpful discussions. 


(5) H. Messen: Proc. Roy. Irish Acad., A 54, 125 (1951). 
(9) -H. MesseL: Proc. Phys. Soc., A 64, 726 (1951). 
(7) H. Messen: Proc. Phys. Soc., A 65, 465 (1952). 
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and Thermomagnetic Phenomena (*). 


R. FIESCHI 


Gruppo di Fisica dei Solidi, Istituto Nazionale di Fisica Nucleare - Sezione di Milano 


(ricevuto il 26 Maggio 1955) 


In a previous paper ('), hereafter denoted as (I), the equations describing 
the galvanomagnetic and thermomagnetic phenomena in anisotropic crystals 
have been derived from more general macroscopic equations for transfer phe- 
nomena in the presence of a magnetic field. The equations which have been 
obtained (cfr. I, eqs. (I1.138, 139) or (11.142, 143)), are slightly different from 
those used by other authors (2>*), as the gradient of the ditference between 
the electrochemical potentials of electrons and ions, 7, — ,;, appears, instead 
of the gradient of @, alone (*). It is however easy to show that the two sets 
of equations are equivalent. 

The equations of I are written with respect to the center of mass of the 
system, i.e. the entropy flow J, and the electronic flow J, are defined with 
respect to the center of mass. If we remember that we are dealing with systems 
consisting of a rigid ion lattice and of electrons, and that the electronic mass 
is negligible in comparison with the ionic mass, we see that no physieal diffe- 
rence is introduced in writing equations in which the flows are defined with 
respect to the rigid lattice. The new equations obtained with this change 


(*) Appendix to a previous paper. (See reference (1) below). 

(1) R. Fresour: Suppl. Nuovo Cimento, 1, 1 (1955). 

(2) S. R. pe Groot: Thermodynamics of Irreversible Processes (Amsterdam and New 
York, 1951). 

(*) H. B. CALLEN: Phys. Rev., 73, 1349 (1948), 85, 16 (1952); R. Frescut, S. R. 
DE Groor and P. Mazur: Physica, 20, 259 (1954). 

(4) The symbols we use here have been introduced and defined in I. 
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and the corresponding change of the conjugated thermodynamical «forces » 
are written in the more usual form. 

Let us write the entropy flow J, and the electronic flow J, with respect 
to the lattice: 


/ I 
(1) J, =J, + so(v —v,), 
(2) J-=J.+ 0(v—v,) ==VJ., 
0; 
where use has been made of the relations J, = —J; = — oi(v; —v) and 


o= 0; + 9,. vis the barycentric velocity, v; the velocity of the ionic lattice, 
o the total density, 0, and 0; the densities of electrons and of ions, and s the 
entropy per unit mass. 

As is well known the «forces » X, conjugated with these new fluxes can 
be found as linear combinations of the former forces X,, by noting that the 
expression for the entropy production o is invariant under the transformation 
from the former to the new set of thermodynamical quantities: 


(3) — To = J, X, + J, X,=J,°X, na JE 
hence 
(4) Je K + IX = I Xt Jet (Xi + 5X), 


where the expressions (1) and (2) for the new fluxes have been used. Relation (4) 
is satisfied if we put 


(5) Xp 
and 
O; 
5 zi A ae OLS 
(6) X, DOS 


By sybstituting in (6) the explicit expressions for X, and X,, namely 


(7) Ao era, 
(3) X, = grad (fi, — fi) , 


(the time derivative of the vector potential A has been neglected), one has, 


0; i ~ 
(9) X} = —s grad T + — grad (4, — ti) = 
0 
yy 04 9 | x a ht 
— —sorad 7T+"[(e,— e;) gradg + grad uw. — grad ui] , 
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where the definition of electrochemical potential has also been used: ff, = 
= ur + &@, Mx being the chemical potential of component k, e, the charge 
density per unit mass and @ the electric potential. The Gibbs-Duhem relation, 
when pressure, as in our case, is uniform, gives 


Ov S 
(10) — grad wu; = — grad ue — È grad 7. 
Oi 0; 


By substituting this expression for grad yw; in (8) we have 
(et ea Xi == (e, —¢;) grad pg + grad e; 


and finally, as 0; — 0 and e, > e; (the charge density per unit mass of electrons 
is much larger than the charge density per unit mass of ions), and putting e=e,, 


(12) Xi = egradg + grad u,. 
The entropy production can now be written 
paia I 
(13) de (Jrgrad T= grad i.) : 
6 


where the electric current density i eJ, has been introduced. Assuming, 
according to the scheme of the Onsager theory, linear relations between the 
« fluxes » and «forces » which appear in (12), we have the phenomenological 
equations in the desired form: 


(14) TEL sradP eh, srad g.. 
(15) t= —L,,-erad Leo erad we. 

Among the phenomenological coefficients of equations (14) and (15) the reci- 
procal relations which have been proved in I still hold, since the validity of 


these relations is conserved by linear transformations of fluxes and forces which 
leave the entropy production invariant (?). 


Thanks are due to Dr. P. MAzur for useful discussions, 
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NUOVO CIMENTO 


A CURA DELLA SOCIETA ITALIANA DI FISICA 


955 ¢ ; 
LUG 20 Semestre 


Lingua e stile di Galileo (*). 


SB ORIO 


Istituto di Glottologia dell’ Università - Pisa 


(ricevuto il 14 Ottobre 1955) 


Uno dei giudizi più comunemente ripetuti su GALILEO scrittore fino alla 
critica crociana è che egli sia stato il creatore della prosa scientifica italiana. 
È ovvio che GALILEO è il creatore soltanto della sua propria prosa (+) e che, 
se il suo esempio è stato di incalcolabile valore per fondare una tradizione, 
l’uso del volgare faceva parte di quella rivoluzione scientifica da cui nacque 
l'indagine moderna e veniva, per così dire, suggerito dalle stesse circostanze 
dell’insegnamento e della polemica. 

La portata immensa delle scoperte di GALILEO, il bisogno di comunicare 
ad una cerchia vastissima di scolari e di studiosi i frutti delle sue osservazioni 
e delle sue meditazioni sono ragioni che bastano a spiegare l’uso del volgare 
invece del tradizionale latino. Vi è, però, anche un’altra ragione, che va cer- 
cata nell’intimo dello spirito e della formazione stessa di GALILEO uomo e 
scrittore: il bisogno di chiarire a se stesso e agli altri ogni aspetto della ricerca, 
la necessità di non lasciare nessuna zona dell’indagine nell'ombra e di render 
possibile il dialogo a tutti, quasi bisogno imperioso di misurare sulle altrui 
capacità di ragionamento e di comprensione ciò che la mente gli aveva sug- 


(*) Conferenza tenuta a Pisa alla Domus Galileana il 10 Maggio 1954. 
(1) V. U. Bosco: Galileo scrittore, in La Cultura, pp. 110-118 (1932). 
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gerito. Il 16 Agosto 1636 (XVI, p. 475 (*)) scriveva a FULGENZIO MICANZIO: 


«Non mancherò di far provisione di tutto il resto delle mie opere 
per mandargliele [a L. ELZEVIR] e, se sarà possibile, tutte latine: se ben 
per ver dire, dove oltre alle serrate dimostrazioni pure matematiche 
entrano discorsi, nel trasportar l’opere dalla lingua del loro autore in 
un’altra, si perde assai di grazia, e forse di energia e anco di chiarezza ». 


Non senbra che GALILEO indugiasse mai troppo a considerare che il latino 
era richiesto dal pubblico dei dotti stranieri, così come RoBERTO GALILEI gli 
faceva notare da Lione (XVI, p. 72) a proposito delle Nuove Scienze; 
anzi, apertamente e senza esitazione, dà egli stesso ragione del suo scrivere 
in volgare (XI, p. 335): 


«Mi è convenuto scriver questo Discorso [è il Discorso intorno alle 
cose che stanno in su Vacqua, del 1612] in lingua italiana, acciò possa esser 
inteso, almeno in parte, da tutta la città, perchè così ha portato Voc- 
casione di certa disputa ». 


Ma vi è di più. Come ha fatto notare il MIGLIORINI (*), pare che GALILEO, 
sia pure per un solo istante, abbia sperato che la lingua italiana potesse diven- 
tare la lingua internazionale della scienza. Nella sua terza lettera delle macchie 
solari dice di aver scritto in italiano anche perché MARCO WELSER, tedesco di 
Augusta, già studente a Padova e buon conoscitore della nostra lingua, risponda 
in italiano (V, p. 189): 


« Dispiacemi ancora della difficoltà che apporta ad Apelle l’aver io 
scritto nella nostra favella fiorentina: il che ho fatto per diversi rispetti, 
uno dei quali è il volere in certo modo abusare la ricchezza e perfezion 
di tal lingua, bastevole a trattare e spiegar e’ concetti di tutte le facol- 
tadi; e però dalle nostre Accademie e da tutta la città vien gradito lo 
scrivere più in questo che in altro idioma. Ma in oltre ci ho auto un altro 
mio particolar interesse, ed è il non privarmi delle risposte di V. S. in 
tal lingua, vedute da me e da gli emici miei con molto maggior diletto 
e meraviglia che se fossero scritte del più purgato stile latino; e parci 
nel leggere lettere di locuzione tanto propria, che Firenze estenda i suoi 
confini, anzi il recinto delle sue mura, sino in Augusta ». 


In un primo tempo GALILEO aveva scritto: «sino in Germania », poi cor- 
resse per un innato senso di prudenza e di misura. Tali speranze, sia pure appena 
accennate di GALILEO, avevano un fondamento. Siamo nel 1612 e, se anche 


(*) Cito dai volumi dell’Edizione nazionale delle Opere di GALILEO. 
(?) In Galileo e la lingua italiana. A questo lavoro, pubblicato nel vol. Lingua e 
cultura (Roma, 1948), pp. 135-158, dobbiamo molti suggerimenti. 


rap 
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_ qualche tentativo di dotti stranieri era stato fatto per abbandonare il latino 
~ (da BACONE nel 1605, quando uscì in inglese Of Proficience and Advancement 
of Learning, subito, però, riscritto in latino col titolo De dignitate et augmentis 

scientiarum 0, prima ancora, da CHEPLERO che, nel 1599, pubblicò un almanacco 

in tedesco per agevolare ai nobili e ai prelati la lettura e, corrispondentemente, 

a se stesso un congruo aumento di stipendio), la partita era ancora pienamente 

aperta. CARTESIO solo nel 1637 faceva uscire il Discours de la méthode in cui 

si legge: 

«Si j'écris en frangais, qui est la langue de mon pays, plutòt qu’en 
latin, qui est celle de mes précepteurs, c’est A cause que j’espére que 
ceux qui ne se servent que de leur raison naturelle toute pure jugeront 
mieux de mes opinions que ceux qui ne croient qu’aux livres anciens. Et 
pour ceux qui joignent le bon sens avec l’étude, lesquels seuls je souhaite 
pour mes juges, ils ne seront point, je m’assure, si partiaux, pour le 
latin, qu’ils refusent d’entendre mes raisons, pour ce que je les expli- 
que en langue vulgaire ». 


Esigenze simili a quelle di GALILEO sono dunque affacciate almeno venti- 
cinque anni più tardi da CARTESIO che, tuttavia, anche in seguito scriverà in 
latino traducendo poi in francese. 

GALILEO rimase fedele, dopo il 1610, e cioè dopo il Sidereus Nuncius, al 
volgare e se anche nelle sue opere non troviamo più nessun accenno ad una 
qualunque possibilità di vedere l’italiano diventare la lingua universale della 
scienza, il suo persistere nel volgare è pur sempre significativo indizio di segreta 
speranza. 

Il punto su cui GALILEO insiste è la necessità che ciascuno si renda perso- 
nalmente conto degli argomenti trattati ed abbandoni il dogmatismo nella 
scienza. In una gustosissima lettera del 16 Giugno 1612 (XI, p. 327) diretta 
al GUALDO, egli scrive: 


«Ho ricevuto dal S. Velsero avviso come la mia gl’è pervenuta e che 
gl’è stata grata; ma che Apelle per hora non potrà vederla, per non in- 
tender la lingua. Io ho scritta vulgare perché ho bisogno che ogni per- 
sona la possi leggere, e per questo medesimo rispetto ho scritto nel mede- 
simo idioma questo ultimo mio trattatello [il Discorso sulle Galleggianti |; 
e la ragione che mi muove, è il vedere, che mandandosi per gli Studii 
indifferentemente i gioveni per farsi medici, filosofi, ete., sì come molti si 
applicano a tali professioni essendovi inettissimi, così altri, che sariano atti, 
restano occupati o nelle cure familiari o in altre occupazioni aliene dalla 
litteratura, li quali poi, benché, come dice Ruzzante, forniti d’un bon 
snaturale, tutta via, non potendo vedere le cose scritte in baos, si VE 
persuadendo che in que’ slibrazzon ghe suppie de gran ogy de Were : 
de filuorica, e conse purassé che strapasse in elto purassé; et io voglio ch'e 


ni 


vegghino che la natura, sì come gl’ha dato gl’occhi per veder l’opere 
sue così bene come a i filuorichi, gli ha anco dato il cervello da poterle 
intendere e capire. Contutto ciò vorrei che anco Apelle e gl’altri oltra- 
montani potessero vederla; e qui, per esser io occupatissimo, haverei 
bisogno del favore di V. S. e del S. Sandeli, il quale mi facesse grazia di 
trasferirla quanto prima in latino e mandarmela poi subito, perché in 
Roma è chi si è preso cura di farla stampare insieme con alcune altre mie ». 
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Che, in verità, prima di GALILEO le scienze in Italia non fossero state trat- 
tate in volgare, non si può affermare. LucA PACIOLI e il TARTAGLIA baste- 
rebbero a mostrare il contrario; non vi è dubbio, tuttavia, che né l'uno né 
l’altro furono apprezzati per quanto il primo fosse toscano. Erano rimasti, 
per questo rispetto, degli isolati. Contro di loro vi erano la tradizione scolastica 
e la scienza ufficiale con tutto il loro peso. Le matematiche, per quanto rite- 
nevano di filosofico e non si limitassero a semplice computo, restavano pur 
sempre nella sfera della tradizione dotta. 

La rivoluzione — perché tale è veramente — dell’uso del volgare avrebbe 
potuto essere anticipata di almeno un secolo se le opere di LEONARDO fossero 
state sistematiche e fossero diventate note ad un pubblico vasto. 

Nella vita di GALILEO scritta da NICCOLÒ GHERARDINI si legge (XIX, p. 642): 


«Questo gran personaggio [GUSTAVO re di Svezia], intrattenendosi per 
alcuni mesi in Padova, si trovò quasi sempre presente alle lezioni del 
Sr. Galileo, il quale a richiesta di quei gran Signori fu persuaso a leggere 
in lingua toscana; conciossiacosaché essendo tutti questi assai ben instrutti 
nella latina favella et in qualche parte dirozzati nell’italiana, desidera- 
vano nell’istesso tempo ch’imparavano le scienze et acquistarne d’essa 
la perfezzione: e da indi in poi di rado e quasi mai fu udito il S. Galileo 
con altra lingua fuori che con la natia, etiamdio nella pubblica cattedra, 
favellare; il che diede materia ad alcuni suoi poco amorevoli di tacciarlo 
come di poco pratico nell’idioma latino: ma la verità è che ciò facea per 
compiacere alla voglia degli scolari, la maggior parte oltramontani, e 
per mettere in reputazione il parlar toscano, con adattare acconciamente 
i termini d’esso alle conclusioni di filosofia e di mathematica, senza diman- 
dargli in presto o mendicargli da altra lingua che non fosse la propria, 
contro l’oppenione dei più, quali per addietro ciò stimavano inconve- 
nevole, anzi impossibile ». 


Nelle postille del VIvIANI si legge, però, a correzione del GHERARDINI: 


«Non è vero che leggesse in toscano in pubblico, ma al più qualche 
lezzione straordinaria »; 
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e la notizia è ripetuta nel Racconto istorico della vita di Galileo dello stesso 
VIVIANI (XIX, p. 629): 


« E volle [re GustAvo] nell’istessa casa di lui (con l'interesse d’eserci- 
tarsi insieme nelle vaghezze della lingua toscana) sentire l’esplicazione 
della sfera, le fortificazioni, la prospettiva e l’uso di alcuni strumenti 
geometrici e militari, con applicazione et assiduità di vero discepolo, 
discoprendogli in fine con amplissimi doni quella regia maestà ch'egli 
s'era proposto di occultare ». 


Stando, dunque, al VIVIANI, GALILEO non fece lezione pubblica in volgare: 
tanto poterono anche su di lui le tradizioni accademiche, alle quali, come il 
MIGLIORINI ha fatto notare, si era eccezionalmente sottratto, nella prima metà 
del ’500 FRANCESCO VERINO che, secondo le colorite parole del GeLLI (4° Ra- 
gionamento dei Capricci del Bottaio) «leggendo [nello studio] filosofia e veggen- 
do talvolta venire a udirlo il capitano Pepe, il quale non intendeva la lingua lati- 
na, subito cominciava a leggere in vulgare perché e’ potesse intendere anch'egli. 
E di poi, poco innanzi che egli si morisse [1543], per dimostrare la inestimabile 
bontà sua, leggendo pubblicamente ne lo Studio fiorentino il duodecimo libro 
de la divina Filosofia d’Aristotele, volse esporlo in vulgare acciocché ogni qua- 
lità di uomo lo potesse intendere; affermando insieme con Paulo Apostolo di 
esser così debitore a gli indotti come ai dotti ». 

Queste parole sono rivolte dall'anima a Giusto il bottaio, che le obietta: 


«Ma coteste cose di filosofia possonsi elleno dire in vulgare? ». 
E Vanima: 


« O perché no? non è la lingua vulgare così ben atta a manifestare 
i concetti suoi come la latina, e l’altre che son tenute belle e buone? ». 


E, ai rinnovati dubbi di Giusto, che si era appellato ai dotti: 


«Giusto, questa è una di quelle cose che la fa dir loro la invidia; ma 
ei non ci andrà molto tempo, mercé del nostro illustrissimo Duca, che, 
seguitando di esaltarla, come egli ha incominciato, vi saranno levati 
questi vetri gialli da gli occhi, che vi fanno vedere ogni cosa giallo ». 


La profezia doveva realizzarsi; solo, invece che del Duca, il merito sarebbe 
stato di GALILEO. 

Tl Sidereus Nuncius, del 1610, segna l’addio di GALILEO al latino, ma pre- 
cedenti volgari vi erano già stati con La bilancetta del 1586 e con le prime let- 
tere del 1588 in cui lo stile appare maturo e sia pure meno sciolto che nelle 
opere maggiori. Anche molte minute furono stese in volgare per lezioni tenute 


in latino. 


r 
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Fra le opere latine di GALILEO occorre, naturalmente, distinguere ciò che è 
compendio di lezioni altrui, come per esempio i riassunti delle lezioni di un 
maestro pisano sul De coelo di ARISTOTELE (I, p. 24) e le opere originali, i 
Theoremata (1585), il De motu (1590), il Sidereus Nuncius (1610). 

Nei riassunti i valori di stile sono minimi e le qualità usuali ed impersonali; 
nelle opere originali, il fervore della scoperta, il lievito di pensiero, l’evolversi 
dell’indagine danno un accento particolare alla lingua che assume un aspetto 
personale nell'impasto dei volgarismi e delle forme del parlato, pur nella solen- 
nità del periodare latino (4). Come è da aspettarsi, negli scritti latini vi sono 
elementi volgari e nelle opere volgari del primo periodo fino a La bilancetta 
vi sono dei giri di frase latineggianti. Vediamo qualche esempio. Nei Theore- 
mata, si legge (I, p. 30): 


«Quaeritur, 49, an mundus sit perfectus. Videtur non esse: tum quia 
in illo reperiuntur monstra et defectus, tum quia non semper habet 
suas species actu existentes, tum quia illi possunt addi multae species 
perfectiores; quae omnia imperfectionem declarant. Adverte, pro solu- 
tione, aliquid posse dici perfectum dupliciter: vel simpliciter, vel secundum 
quid. Perfectum simpliciter, dicitur illud cui nihil addi potest in per- 
fectione; perfectum secundum quid, illud est qued in suo quidem genere 
est perfectum, non tamen habet perfectionem omnem possibilem ». 


È lo stile solito dei compendi, degli studi scolastici con una, per così dire, 
nervosa sollecitazione volgare. Il contenuto è ancora tradizionale. 

Diversamente nel De motu, in cui GALILEO già si allontana dalla dottrina 
aristotelica ed esprime ormai una materia in cui è impegnato direttamente 
(Typ: 334): 


«Oh, quam facile ex veris principiis verae extrahuntur demonstra- 
tiones! Si verum est, quod diximus, mobilia, dum a statu quietis rece- 
dunt, cum tanta contraria vi impressa recedere, quanta est eorum gra- 
vitas; ergo quae graviora fuerint, cum maiori virtute contraria coniuncta 
recedent: quod si gravioribus plus de virtute ad contrarias partes impel- 
lente est absumendum quam levioribus, graviora certe ut tardius mo- 
veantur necesse erit, cum maiorem patiantur resistentiam. Et si, rursus, 
haec vera sunt, sequitur ut graviora, postquam tantum de resistentia 
contraria absumpserint ut non amplius tanto impediantur quanto leviora, 
ocius descendere debeant: quod certe rursus experientia monstrat ». 


(*) Si vedano le belle pagine di R. SPonGANO in Za prosa di Galileo e altri scritti, 
Messina- Firenze, 1949, p. 93 e seg. 
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L’impegno, che abbiamo visto espresso con quell’esclamazione all’inizio del 
periodo or ora citato, si acuisce quando la polemica diventa diretta (I, p: 302): 


« Aristotelem parum in geometria fuisse versatum, multis in locis 
suae philosophiae apparet; sed in hoc potissimum, ubi asserit, mo- 
tum circularem motui recto non esse proportionatum, quia, scilicet, 
recta linea curvae non est proportionata aut comparabilis: quod quidem 
mendacium (indignum enim est nomine opinionis), nedum intima et 
magis recondita geometriae inventa, Aristotelem ignorasse, verum et 
minima etiam principia huius scientiae, demonstrat. Nanque, quomodo 
dixit rectam et circularem non essere proportionatas, si ad hoc, ut quan- 
titates inter se proportionem habeant, sufficit ut minor possit toties 
multiplicari ut alteram excedat? An forsan chorda arcus, quae arcu minor 
est, in se saepius multiplicata, arcus longitudinem non excedet? At si 
excedet, cur dicitur ab Aristotele, arcum et chordam non esse propor- 
tionata? ». 


Questo mosso discorso che spesso ricorre ad interrogazioni retoriche e ad 
improvvisi arresti, questo inseguirsi di idee nuove si accompagna alla libertà 
del lessico pieno di elementi volgari trasportati in latino: bilanx, ad differen- 
tiam, peripheria, ad minus, media « mezzi », ecc. (5). 

Nel Sidereus Nuncius il tono di acuta polemica lascia il campo a deseri- 
zioni ampie e in qualche punto commosse (III, p. 59): 


«Magnum sane est, supra numerosam inerrantium Stellarum multi- 
tudinem, quae naturali facultate in hune usque diem conspici potuerunt, 
alias innumeras superaddere oculisque palam exponere, antehac con- 
spectas numquam, et quae veteres ac notas plusquam supra decuplam 
mult:plicitatem superent. 

Pulcherrimum atque visu iucundissimum est, lunare corpus, per sex 
denas fere terrestres semidiametros a nobis remotum, tam ex propinquo 
intueri, ac si per duas tantum easdem dimensiones distaret ». 


In queste parole sembra palpitare la commozione dei nuovi orizzonti aperti 
alla visione dell’uomo. Tale accento si avverte anche là dove la descrizione si 
fa più minuta e più rigorosamente scientifica (III, p. 62): 


«De facie autem Lunae, quae ad aspectum nostrum vergit, primo 
loco dicamus. Quam, facilioris intelligentiae gratia, in duas partes di- 
stinguo, alteram nempe clariorem, obscuriorem alteram: clarior videtur 
totum hemisphaerium ambire atque perfundere, obscurior vero, veluti 
nubes quaedam, faciem ipsam inficit maculosamque reddit. Istae autem 


(5) V. R. SPONGANO: op. cit., p. 102. 
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maculae, subobscurae et satis amplae, unicuique sunt obviae, illasque 
aevum omne conspexit; quapropter magnas, seu antiquas, eas appella- 
bimus, ad differentiam aliarum macularum amplitudine minorum, at 
frequentia ita consitarum, ut totam Lunarem superficiem, praesertim 
vero lucidiorem partem, conspergant; hae vero a nemine ante nos obser- 


vatae fuerunt ». 


Gia abbiamo accennato alle movenze latine delle prime opere volgari. Ecco 
come incomincia La bilancetta (I, p. 215): 


« Sì come è assai noto a chi di leggere gli antichi scrittori cura si 
prende, avere. Archimede trovato il furto dell’orefice nella corona d’oro di 
Terone, così parmi esser stato sinora ignoto il modo che sì grand’uomo 
usar dovesse in tale ritrovamento; atteso che il credere che procedesse, 
come da alcuni è scritto, co ’1 mettere tal corona dentro a l’aqqua, aven- 
dovi prima posto altrettanto di oro purissimo e di argento separati, e 
che dalle differenze del far più o meno ricrescere o traboccare Vaqqua 
venisse in cognizione della mistione dell’oro con l’argento, di che tal 
corona era composta, par cosa, per così dirla, molto grossa e lontana dal- 
l’esquisitezza ». 


Il periodo, con la sua paludata e retorica complessità, arieggia in tutto un 
periodo latino: ci sono gli infiniti con l’accusativo, i verbi principali sono posti 
in fondo e questo procedimento d’inversione assurge quasi a sistema. Se 
GALILEO avesse costantemente seguita questa via non sarebbe certo da anno- 
verare fra i maggiori scrittori italiani. 

Ad ogni modo, già nella sua esperienza di scrittore latino e nelle sue prime 
scritture volgari, GALILEO si mostra aderentissimo alle cose e, se non proprio 
immune da retorica, soltanto ai margini di essa. La parola è in dipendenza 
diretta del pensiero e lo rappresenta con immediatezza. Il pensiero non cede 
di fronte alla lingua ma la informa arditamente di sé. 

Ma è ora di affrontare il quesito fondamentale che si pone parlando di 
GALILEO scrittore. In che cosa consiste l’importanza di GALILEO nella nostra 
storia letteraria? Le opinioni dei critici sono discordi. Abbandonato col Bosco 
il mito di GALILEO creatore della prosa scientifica italiana, vi sono stati finora 
due modi di intendere GALILEO scrittore: 0 col SAPEGNO (6), come continuatore 
della tradizione cinquecentesca, 0, con lo SPONGANO (7), come un’apparizione 
nuova, originale, senza nulla in comune con gli scrittori del Cinquecento, se 
si esclude il mezzo di espressione, la lingua toscana. Queste due diverse posi- 
zioni, validamente sostenute dai due illustri critici, si fondano su un problema 
che va chiarito. 


(°) In Galileo scrittore (Atti e Mem. Accad. dell’ Arcadia, 1948). 
(NO per cite 
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Parlare di Cinquecento in forma astratta, in generale, come se costituisse 
una categoria assoluta, non pare legittimo. Vi è un Cinquecento del MACHTA- 
VELLI e del GUICCIARDINI e un Cinquecento del BEMBO e del CASTIGLIONE: 
vi è un Cinquecento dell’ARETINO e del Dont e un Cinquecento di LEONARDO 
e del CELLINI. Vi sono nel Cinquecento scrittori sinceri ed immediati, ve ne 
sono altri immersi nella retorica, e non soltanto in quella di stile elevato, ma 
anche in quella (meno osservata ma non meno reale) popolaresca. Il Cinque- 
cento ha una corrente latineggiante e boccaccevole che si sviluppa fuori di 
Toscana ed una corrente più fresca, propria di autori toscani, tanto che un 
giovane studioso ha potuto recentemente parlare (*) di un edonismo lingui- 
stico del Cinquecento. 

Considerando partitamente gli scrittori del Cinquecento sarebbe molto dif- 
ficile stabilire una filiazione di GALILEO; se, d’altra parte, ci riportiamo ad 
ideali di misura e di chiarezza, se facciamo questione di gusto e di decoro e 
confrontiamo il suo stile con quello di BRUNO e di CAMPANELLA, allora i legami 
col Cinquecento appaiono innegabili. In questo caso si comprende come la 
pagina notissima del VIVIANI, citata da tutti i critici, abbia sapore e valore 
anche per il critico attuale per scoprire la poetica di GALILEO (XIX, p. 627): 


«Fu dotato dalla natura d’esquisita memoria; e gustando in estremo 
la poesia, aveva a mente, tra gl’autori latini, gran parte di Vergilio, 
d’Ovidio, Orazio e di Seneca, e tra i toscani quasi tutto’1 Petrarca, tutte 
le rime del Berni e poco meno che tutto il poema di Lodovico Ariosto, 
che fu sempre il suo autor favorito e celebrato sopra gl’altri poeti, aven- 
dogli intorno fatte particolari osservazioni e paralleli col Tasso sopra 
moltissimi luoghi.... Parlava dell’ Ariosto con varie sentenze di stima e 
e d’ammirazione; et essendo ricercato del suo parere sopra i due poemi 
dell’Ariosto e del Tasso, sfuggiva prima le comparazioni, come odiose, 
ma poi, necessitato a rispondere, diceva che gli pareva più bello il Tasso, 
ma che gli piaceva più 1’ Ariosto, soggiugnendo che quel diceva parole, 
e questi cose. E quand’altri gli celebrava la chiarezza et evidenza nel- 
l’opere sue, rispondeva con modestia, che se tal parte in quelle si ritro- 
vava, la riconosceva totalmente dalle replicate letture di quel poema, 
scorgendo in esso una prerogativa solo propria del buono, cioè che quante 
volte lo rileggeva, sempre maggiori vi scopriva le meraviglie e le perfe- 
zioni ». 


Lo scrittore prediletto di GALILEO è, dunque, un poeta, ARIOSTO, e questo 
suo orientamento è rivelatore. La congenialità del grande scienziato col poeta 
del Rinascimento, è fondata su elementi di gusto, di quel gusto che poneva, 


(8) C. Segre. in Giorn. st. lett. ital., 70, 1953, p. 14: 
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fra GALILEO e il TASSO, una barriera di incomprensione, che faceva tacere nel 
giudizio critico di GALILEO i lati positivi del poeta della Gerusalemme coi suoi 
inquieti chiaroscuri, col corrispondersi continuo delle passioni intime col mondo 
reale, col contrappunto di nuove melodie e risonanze ignote al classicismo del 
primo Cinquecento. 

Di chiarezza e di evidenza si parla nel VIVIANI: è lo spirito fiorentino contro 
una forma di decadentismo che tende verso morbidezze a GALILEO ignote o 
sgradite, Nella sua avversione per gli « arzigogoli », gli « scambietti metafisicali », 
le « capriole intrecciate » del TASso, in questa sua pedanteria nella valutazione 
di un pur grande poeta, troviamo l'orientamento sicuro per penetrare nello 
studio dello stile galileiano. 

Non sarà neppur male rotare che, in qualche caso, egli dà sulla voce per- 
sino al prediletto ARrIosTO, là dove in minime giunture, in rari accostamenti 
di parole gli pare poco spontaneo, poco chiaro e alquanto duro. Ad esempio 
«per fuggire la durezza del secondo verso e la continuazione di otto i» pro- 
pone (IX, p. 185) di mutare il verso ariostesco (Orlando Furioso, XXXVII, 82): 


«Da le mogli così furo i mariti 

da le madri così i figli divisi » 
in questo modo: 

« E così dalle mogli i lor mariti 

e i figli dalle madri fur divisi ». 


Qui pare di sentire GALILEO musico insorgere un po’ pedantescamente con- 
tro una cacofonia dell'amato ed ammirato poeta. E non viè dubbio che, fra i 
caratteri estrinseci della prosa galileiana, vi è una accurata sapienza stilistica, 
una chiarezza, una sostenuta armonia, un tono sempre identicamente digni- 
toso pur nella varietà degli atteggiamenti. 

A comprendere la sua prosa, servono anche i suoi errori estetici, come segno 
di gusto e come presupposto del suo concreto modo di scrivere. 

Anche il fatto che GALILEO, a detta del VIVIANI (XIX, p. 625), « era... prov- 
visto di pochissimi libri», aiuta a comprendere la singolarità della sua compo- 
sizione. Perché, se è vero, come dice il MoMIGLIANO che «la storia generale della 
nostra prosa è anzitutto storia di classicismo e di retorica » (*), è anche vero che 
alcuni fra i nostri autori più grandi furono immuni dalla retorica e retorico non 
fu GALILEI (1°). A tenerlo lontano dalla retorica fu la sua mente scientifica, il 


suo restare a contatto continuo con la realtà, il suo ingenito senso della misura 
delle cose. 


(9) Elzeviri, pp. 50-56. 

(1°) Una ricerea degli elementi barocchi nella prosa di GALILEI, ad opera di GruLio 
Marzor in Convivium (1954, pp. 678-689 e 1955, p. 43), così si conclude: « Noi rite- 
niamo che il GALILEI sia da dirsi scrittore di evidenza barocca: ma barocco solo per 
quel tanto che la classicità, vale a dire l'armonia dei suoi spiriti e modi, lo consentiva ». — 
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Lo spirito di osservazione si tramuta in lui, come dice il LEOPARDI, in 
« scolpitezza evidente » (!!), giudizio, sia pure accomunante GALILEO col REDI, 
che riscatta quell’ambigua osservazione (« Il nostro Galileo lo chiami elegante 
chi non conosce la nostra lingua »), che si rifà ad una infelice frase del GIOR- 
DANI che era venuto fuori, parlando dell’« acutissimo ed elegantissimo ingegno » 
del Cardinale PALLAVICINO, con queste parole: « AI sommo Galileo sovrabbondò 
la mente, ma parve quasi mancare lo studio pell’opera di scrivere; in quella co- 
| pia diffuso e soverchio, talora languido, talora confuso » (12). Ma nello stesso LEo- 
PARDI vi è un altro giudizio, contrastante singolarmente con questo: « Di questa 
associazione della precisione coll’eleganza, è splendido esempio lo stile di Celso 
e fra’ nostri di Galileo » (3%). 
Da tale contraddizione è possibile arguire Vincertezza del giudizio critico 
| per la singolarità stessa della prosa galileiana. La quale sorge, non già su una 
tradizione formata (almeno per quanto ogni prosa si rifà a una scuola), ma 
su una nuova visione del mondo e accompagna fedelmente le speculazioni 
scientifiche e filosofiche con larghezza e chiarezza di impostazione e con una 
singolare capacità di mantenersi sempre ad un livello elevatissimo. Egli diceva 
che non si possono unire «i fioretti retorici con la saldezza delle dimostra- 
zioni ) ed aggiungeva (VI, p. 330): 


«Io non so vedere perché, potendo uno dir bene assolutamente con 
una semplicissima e propriissima parola, ei debba servirsi d’una impropria 
e bisognosa di limitazioni ed insomma d’esser finalmente trasportata in 
un’altra molto diversa ». 


Si è detto che il ragionamento del MACHIAVELLI procede secondo una forma 
di catena; altrettanto avviene del ragionamento di GALILEO in cui si notano, 
però, anelli solidamente ribattuti. I suoi periodi sono lunghissimi e la più 
genuina ed immediata impressione che si ha leggendoli è come di meraviglia 
| perché quella complessità di ragionamento non stanca mai: tutto si regge su 
un’armonia e su un equilibrio che ha quasi il valore di una forza della natura. 
Le esemplificazioni si possono trovare in ogni pagina del Saggiatore, dei Massimi 
| sistemi, delle Nuove scienze. Leggiamo questo passo del Saggiatore (VI, p. 251): 


«... Qualunque volta sia presente l’effetto, necessariamente vi è anco 
quella causa. E così, per darne un esempio, chi dicesse, “ Il tale ha acceso 
il fuoco, adunque si è servito dello specchio ustorio ”’, errerebbe, potendo 
derivar l’accendimento dal batter un ferro, dall’esca e fucile, dalla con- 
fricazione di due legni, e da altre cause; ma chi dicesse “Io ho sentito 
batter il fuoco al vicino ”’, e soggiungesse “ Adunque egli ha della pietra 


(11) Zibaldone, ediz. curata da F. FLORA, ease, a 
(12) Op. cit., II, p. 65 e la nota a p. 1342. 
(13) Op. cit., I, p. 886. 


Di 
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focaia”, senza ragione sarebbe ripreso da chi gli opponesse che, con- 
correndo a tale operazione, oltre alla pietra, il fucile, l’esca e ’1 solfanello 
ancora, non si poteva con buona logica inferir la pietra risolutamente. 
E così, se Vingrandimento dell’ango!o e l’union de’ raggi concorrono 
sempre nell’operazioni del telescopio, delle quali una è il far veder l’invi- 
sibile, perchè da questo effetto non si può inferire quale delle due cause 
più ne piace? Io credo di penetrare in parte la mente del Sarsi, il quale, 
s°îo non m’inganno, vorrebbe che il lettore credesse quello ch’egli stesso 
assolutamente non crede, cioè ch’il veder le stelle, che prima erano invi- 
sibili, derivasse non dall’ingrandimento dell’angolo, ma dall’unione de’ 
raggi; sì che, non perchè la specie di quelle divenisse maggiore, ma perchè 
i raggi fussero fortificati, si facesser visibili: ma non si è voluto aperta- 
mente scoprire, perchè troppo gli sono addosso l’altre ragioni del Sig. Mario 
taciute da esso, ed in particolare quella del vedersi gl’intervalli tra stella 
e stella ampliati colla medesima proporzione che gli oggetti quaggiù 
bassi; i quali intervalli non dovrian ricrescer punto, se niente ricresces- 
sono le stelle, essendo loro così distanti da noi come quelle ». 


È difficile, credo, trovare uniti nello stesso periodo, come avviene in GALILEO 
ad ogni pagina, un’onda di proposizioni così ampia e nello stesso tempo così 
chiara. Il discorso è pieno, non lascia sottintendere nulla, ogni cosa è espressa 
con pari minuziosa cura, tutto è incatenato sapientemente e la lettura procede 
con quel senso di compiutezza e quasi di dovizia che non cade mai nell’eccessivo. 
Il freno è dato da quel sentimento del limite che deriva dal gusto dell’autore 
e dall’insegnamento delle cose dalle quali germoglia una lezione di misura e 
di proporzione. 

Nella prosa di GALILEO, anche le sentenze sono di per sè mirabili. Splen- 
dida a me sembra la seguente (VI, p. 340): 


«Se il discorrere circa un problema difficile fusse come il portar pesi, 
dove molti cavalli porteranno più sacca di grano che un caval solo, io 
acconsentirei che i molti discorsi facessero più che un solo; ma il discor- 
rere è come il correre,e non come il portare, ed un caval berbero solo 
correrà più di cento frisoni ». 


Tuttavia quasi sempre esse perdono di vigore e di sapore se sono staccate 


dal contesto perchè, in generale, si incorporano indissolubilmente al tessuto 
(VI, p. 237): 


«Ma pur fussero i veri filosofi come l’aquile, e non più tosto come la 
fenice! Signor Sarsi, infinita è la turba degli sciocchi, cioè di quelli che 
non sanno nulla; assai son quelli che sanno pochissimo di filosofia; pochi 
son quelli che ne sanno qualche piccola cosetta; pochissimi quelli che ne 
sanno qualche particella; un solo Dio è quello che la sa tutta. Sì che, 
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per dir quel ch’io voglio inferire, trattando della scienza che per via di 
dimostrazione e di discorso umano si pud dagli uomini conseguire, io 
tengo per fermo che quanto più essa participerà di perfezzione, tanto 
minor numero di conclusioni prometterà d’insegnare, tanto minor numero 
ne dimostrerà, ed in conseguenza tanto meno alletterà, e tanto minore 
sarà il numero de’ suoi seguaci ». 


Il rivolgersi direttamente all’avversario che vuol confutare è uno dei pro- 
cedimenti più frequenti nello stile di GALILEO e si trova non solo nelle opere 
maggiori come il Saggiatore ma anche in quelle annotazioni marginali e in 
quelle postille che aprono la via a comprendere i ragionamenti di maggior 
impegno. 

La polemica e l’ironia si accompagnano spesso a queste movenze stilistiche 
e talvolta lo sdegno prorompe e il sarcasmo giunge ben a fondo nelle carni 
dell’oppositore. Celebre è quel passo contro ANTONIO Rocco, nelle note mar- 
ginali alle Esercitazioni filosofiche (VII, p. 270): 


« Deh non avessi io mai scoperte queste novità in cielo, di tante innu- 
merevoli non prima vedute stelle fisse, di quel che siano le nebulose, la 
Via Lattea, de’ collaterali di Saturno, della corte di Giove, la immensa 
mutazione di grandezza in Marte, le importune macchie nel Sole, le grandi 
mutazioni di figura e grandezza in Venere, le scabrosità grandissime nella 
Luna; deh mai io non l’avessi palesate al mondo poi che dovevano con- 
citarmi l’odio del signor Antonio Rocco e di tanti altri signori filosofi! © 
Consolatevi, Signore; ché il tempo, scopritore della verità, in breve è 
per estirpare queste fallacie, e più le vane conseguenze che io stoltamente 
ne deducevo, e i vostri scritti, pieni di dottrina ferma e soda, viveranno 
immortali, ad onta delle mie esorbitantissime chimere ». 


Lo stile di GALILEO, sempre tenuto su un piano altissimo, si accende, nel- 
l’urto polemico immediato, fino al sarcasmo e alla passionalità che arriva all’in- 
giuria. A proposito di un’osservazione di LUDOVICO DELLE COLOMBE (autore 
di un discorso contro il moto della terra), GALILEO annota (III, p. 258): 


«Sarebbero matti [i copernicani] a rispondere a queste pippionate ». 


L’ironia e Ja malizia sorgono sempre quando il pacato ragionare non ha 
fatto presa su menti ostinate a non voler capire e su occhi che si rifiutano di 
vedere, come nel caso di quel peripatetico che, assistendo ad una operazione 
anatomica e constatando che i nervi si originano nel cervello, dice (VII, p. 134): 


«Voi mi avete fatto veder questa cosa talmente aperta e sensata, che 
quando il testo di Aristotile non fusse in contrario, che apertamente dice, 
i nervi nascer dal cuore, bisognerebbe per forza confessarla per vera ». 
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O altrove (III, p. 398): 


«Tu di’ che ne i sensibili comuni non si deve credere a un senso senza 
Vattestazione dell’altro; adunque tu vuoi creder più a due testimoni falsi 
che a un solo »; 


in cui, quel rivolgersi direttamente all’interlocutore fa venire in mente le im- 
provvise apostrofi di LEONARDO quando interpella un presunto lettore: 


« Nissuna umana investigazione si pò dimandare vera scienzia, s’essa 
non passa per le matematiche dimostrazioni e se tu dirai che le scienzie, 
che principiano e finiscono nella mente, abbiano verità, questo non si 
concede... ». 


La prosa di GALILEO è così il frutto di una mente di singolari capacità 
creative in cui la concatenazione logica non porta a monotonia o a scorie di 
struttura, in cui il tono commosso di fronte alle nuove visioni della natura 
si unisce alla ferma obiettività del ragionamento e la integra. 

Il culmine letterario della prosa galileiana è il Saggiatore, dove il calore della 
polemica e l'evidenza delle immagini, il procedimento logico e l'evasione poe- 
tica, il passare dalla sentenza al motto di spirito, l’incalzare l’avversario e 
Vabbandonarlo per inseguire una verità improvvisamente balenata danno una 
tale varietà alla pagina da porre la grandezza di GALILEO scrittore accanto a 
quella di GALILEO scienziato. Fin dalle prime pagine tali qualità sono evidenti. 
In quella sentenza (VI, p 214): 


«il contraddire alla geometria è un negare scopertamente la verità ». 


par di sentire ancora una volta LEONARDO con le sue lodi alle « matematiche 
dimostrazioni » e il suo anelito verso la verità. 

Lo stesso piglio di alcune locuzioni: « lasciando il suo scolare, come si dice, 
nelle peste », oppure: « spero d’aver a fare uscir voglia ad alcuno di molestare 
(come si dice) il mastino che dorme, e voler briga con chi si tace » con quelle 
attenuazioni in proposizioni incidentali, dà una nota di colore all’inizio del 
trattato e sembra il sorriso di chi sa di avere già vinto. Così nel motteggiare 


sul titolo dell’opera del SARSI (Libra astronomica ac philosophica), dice (VI, 
p. 220): 


«Ma perchè m’è paruto che, nel ponderare egli le proposizioni del 
Sig. Guiducci, si sia servito d’una stadera un poco troppo grossa, io ho 
voluto servirmi d’una bilancia da saggiatori, che sono così esatte che 
tirano a meno d’un sessantesimo di grano ». 


Ma nella fine del capitolo il periodo diventa più severamente rilevato per 
una osservazione del SARSI a proposito dello Scorpione (VI, p. 221): 


«Adunque molto più proporzionatamente, ed anco più veridicamente, 
se riguarderemo la sua scrittura stessa, l’avrebbe egli potuta intitolare: 
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L’Astronomico e filosofico Scorpione, costellazione dal nostro sovran 
poeta Dante chiamata figura del freddo animale “ che colla coda percuote 
la gente”; e veramente non vi mancano punture contro di me, e tanto 
più gravi di quelle degli scorpioni, quanto questi, come amici dell’uomo, 
non feriscono se prima non vengono offesi e provocati, e quello morde 
me che mai nè pur col pensiero non lo molestai. Ma mia ventura, che 
so Pantidoto e rimedio presentaneo a cotali punture! Infragnerò dunque 
e stropiccerò l’istesso scorpione sopra le ferite, onde il veleno risorbito 
dal proprio cadavero lasci me libero e sano ». 


In questo periodo l’evidente realismo dell'immagine ha una potenza dan- 
tesca. Un’apertura solenne assume il discorso quando si parla della natura 
(VI, p._ 232): 


« Parmi, oltre a ciò, di scorgere nel Sarsi ferma credenza, che nel filo- 
sofare sia necessario appoggiarsi all’opinioni di qualche celebre autore, 
sì che la mente nostra, quando non si maritasse col discorso d’un altro, ne 
dovesse in tutto rimanere sterile ed infeconda; e forse stima che la filoso- 
fia sia un libro e una fantasia d’un uomo, come l’Iliade e Orlando Furioso, 
libri ne’ quali la meno importante cosa è che quello che vi è scritto sia 
vero. Sig. Sarsi, la cosa non istà così. La filosofia è scritta in questo 
grandissimo libro che continuamente ci sta aperto innanzi a gli occhi 
(io dico l’universo), ma non si può intendere se prima non s’impara a 
intender la lingua, e conoscere i caratteri, ne’ quali è scritto. Egli è scritto 
in lingua matematica, e i caratteri son triangoli, cerchi, ed altre figure 
geometriche, senza i quali mezi è impossibile a intenderne umanamente 
parola; senza questi è un aggirarsi vanamente per un oscuro laberinto ». 


Nei Massimi sistemi il procedimento dialettico assume forma dialogica, il 

che permette una maggior varietà stilistica e la costruzione di caratteri, di 
| personaggi che s’imprimono nella memoria del lettore con una loro netta indi- 

vidualità. 

L'osservazione dei fenomeni naturali si amplia in un orizzonte più vasto, 
in temi più grandi che portano a scrutare l’universo nella sua miracolosa im- 
mensita. 

In qualche caso l’aggettivazione si fa ricca, complessa, vuole accompagnare 
la dimostrazione in ogni particolare anche minimo perché nulla sfugga all’occhio 
esperto dell’osservatore (VII, p. 132): 


« Per quello dunque che mi è restato in mente, fu la somma dei discorsi 
di ieri andar esaminando da i fondamenti loro, qual delle due opinioni 
sia più probabile e ragionevole: quella che tiene, la sustanza de i corpi 
celesti esser ingenerabile, incorruttibile, inalterabile, impassibile, ed in 
somma esente da ogni mutazione, fuor che dalla locale, e però essere 
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una quinta essenza diversissima da questa de i nostri corpi elementari, 
generabili, corruttibili, alterabili, ete. ». 


Lo stupore per le opere della natura si esprime talvolta con una serie di 
interrogazioni che lasciano come un’eco profonda, uno strascico di risonanze 
e di armonie (VII, p. 128): 


«Non direm noi che ’1 sapere scoprire in un marmo una bellissima 
statua ha sublimato l’ingegno del Buonarruoti assai sopra gli ingegni 
comuni degli altri uomini? E questa opera non è altro che imitare una 
sola attitudine e disposizione di membra esteriore e superficiale d’un uomo 
immobile; e però che cosa è in comparazione d’un uomo fatto dalla natura, 
composto di tante membra esterne ed interne, de i tanti muscoli, tendini, 
nervi, ossa, che servono a i tanti e sì diversi movimenti? Ma che diremo 
de i sensi, delle potenze dell’anima, e finalmente dell’intendere? non pos- 
siamo noi dire, e con ragione, la fabbrica d’una statua cedere d’infinito 
intervallo alla formazion d’un uomo vivo, anzi anco alla formazion d’un 
vilissimo verme? ». 


Anche qui credo che non si vada errati se si richiama l’ammirato stupore 
di fronte alla natura di LEONARDO DA VINCI. 

Fra le figure più umanamente vere degli scritti di GALILEO spicca Simplicio, 
attaccato ad una tradizione che non si rassegna a morire, ostinato come don 
Ferrante e sempre pronto a trovar mille cavilli, ad opporre ragioni, anche se 
poco valide, alla ricerca di una verità che ormai travolge e trionfa. Il suo pro- 
cedere sillogistico, che non arriva ad arginare l’incalzante ragionamento dei 
due interlocutori, talvolta si esprime con alternative che vorrebbero esser 
conclusive ma non sono; né è raro il caso che nell’affollato suo stile scaturisca 
qualche forma popolaresca (VII, p. 128): 


«O io non sono un di quegli uomini che intendano, o ’n questo 
vostro discorso è una manifesta contradizione. Voi tra i maggiori 
encomil, anzi pur per il massimo di tutti, attribuite all'uomo, fatto 
dalla natura, questo dell’intendere; e poco fa dicevi con Socrate che ?1 
suo intendere non era nulla; adunque bisognerà dire, che né anco la 
natura abbia inteso il modo di fare un intelletto che intenda », 


in cui quel dicevi attribuito alla seconda persona plurale è una forma eminen- 
temente popolaresca. 

Nonostante il loro atteggiamento di battaglia, resta, tuttavia, indubbio 
che i Massimi sistemi ha uno stile più disteso del Saggiatore, più ampiamente 
discorsivo, più calmo .e meno nervoso. 

In qualche punto viene in mente PLATONE, come nel passo seguente in 
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| cui un accenno alla natura o all’ora del giorno fa ritornare sulla terra la mente 


db 


n 
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estatica e perduta nella contemplazione dei prodigi dell’ingegno umano 
pyile pe. 130): 


«Io son molte volte andato meco medesimo considerando, in 
proposito di questo che di presente dite, quanto grande sia l’acutezza 
dell’ingegno umano; e mentre io discorro per tante e tanto meravigliose 
invenzioni trovate da gli uomini, sì nelle arti come nelle lettere, e poi 
‘fo reflessione sopra il saper mio, tanto lontano dal potersi promettere 
non solo di ritrovarne alcuna di nuovo, ma ancora di apprendere delle 
già ritrovate, confuso dallo stupore ed afflitto dalla disperazione, mi 
reputo poco meno che infelice. S’io guardo alcuna statua delle eccellenti, 
dico a me medesimo: E quando sapresti levare il soverchio da un pezzo 
di marmo, e scoprire sì bella figura che vi era nascosa? Quando mesco- 
lare, e distendere sopra una tela o parete colori diversi, e con essi rap- 
presentare tutti gli oggetti visibili, come un Michelangelo, un Raffaello, 
un Tiziano? S’io guardo quel che hanno ritrovato gli uomini nel com- 
partir gl’intervalli musici, nello stabilir precetti e regole per potergli 
maneggiar con diletto mirabile dell’udito, quando potrò io finir di stu- 
pire? Che dirò de i tanti e sì diversi strumenti? La lettura de i Poeti 
eccellenti di qual meraviglia riempie chi attentamente considera l’inven- 
zion de’ concetti e la spiegatura loro? Che diremo dell’architettura? che 
dell’arte navigatoria? Ma sopra tutte le invenzioni stupende, qual emi- 
nenza di mente fu quella di colui che s’immaginò di trovar modo di 
comunicare i suoi più reconditi pensieri a qualsivoglia altra persona, 
benché distante per lunghissimo intervallo di luogo e di tempo? parlare 
con quelli che sono nell’Indie, parlare a quelli che non sono ancor nati 
né saranno se non di qua a mille e dieci mila anni? e con qual facilità? 
con i vari accozzamenti di venti caratteruzzi sopra una carta. Sia questo 
il sigillo di tutte le ammirande invenzioni umane, e la chiusura de’ nostri 
ragionamenti di questo giorno: ed essendo passate le ore più calde, il 
signor Salviati penso io che avrà gusto di andare a godere de i nostri freschi 
in barca; e domani vi starò attendendo amendue per continuare i discorsi 


cominciati ». 


Nei dialoghi delle Scienze Nuove, la tendenza ad un discorso più specifica- 
tamente scientifico ha il sopravvento. Anche i personaggi hanno ormai una 
funzione quasi di pura struttura. Ci avviciniamo al tipo di prosa scientifica 
moderna che mira a giungere direttamente alle dimostrazioni. 

Nelle Scienze Nuove abbiamo ancora un periodare armonioso ma manca 
quella commossa partecipazione umana che faceva così suggestive le pagine 


del Saggiatore. 


L'impressione di misura, di proporzione e di armonia che si ha leggendo 
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le opere di GALILEO è confermata se si considera da vicino il suo atteggiamento 
nei riguardi della terminologia studiata acutamente dal MIGLIORINI (!*). Uno 
scienziato fra i più grandi e fra i più fecondi di nuove scoperte, fra i più ricchi 
di intuizioni come GALILEO non ha introdotto termini nuovi nel vocabolario 
italiano. Egli accoglie senza ribellarsi le voci greche e latine già entrate 
nel vocabolario della scienza come sesquialtero, prostaferesi, parallasse, 
ipogeo, transonoro, ma ha un profondo ritegno, che par quasi inibizione, 
nel coniare termini nuovi. Per supplire alle necessità terminologiche delle sue 
scoperte, GALILEO usa con significato tecnico vocaboli già in uso. È il caso 
di « pendolo » o di «ancora» (per armare la calamita), di tanto preesistenti a 
GALILEO, che vengono, per così dire, tecnificati ed adoprati con un nuovo uso 
scientifico. Spesso si può seguire nelle pagine di GALILEO il passaggio di un 
termine dal suo valore generale a quello tecnico (1). 

Questo atteggiamento di GALILEO porta alla necessità di definire e fissare i 
termini nel nuovo significato e dà un più umano ed aperto respiro alla sua 
pagina. Egli non ha mai voluto sacrificare ad una etichetta il largo e sicuro 
fraseggiare, l’ariosa ampiezza della sua pagina. Avviene, così, che anche le più 
caratteristiche invenzioni galileiane sono state tenute a battesimo da altri. 

Caratteristica è, a questo proposito, la storia della voce telescopio. Subito 
dopo la scoperta, GALILEO chiamò la sua invenzione cannone o occhiale, poi 
cannocchiale, formazione, in verità, non felice. Nel Sidereus Nuncius GALILEO 
usò perspicillum, CHEPLERO sostituì a questo conspicillum, il padre SCHEINER 
(Apelle) helioscopium. Fu FEDERICO CESI che, nel 1611 (almeno), introdusse 
telescopio. Da allora anche GALILEO adottò questo termine che divenne poi 
generale. Il microscopio, altra gloria di GALILEO, fu così chiamato da GIOVANNI 
FABER, accademico linceo di Bamberga, il termoscopio deve il suo nome al BIAN- 
CANI (1620) e poi divenne termometro ad opera del padre LEURECHON; la ci- 
cloide ebbe il suo nome dal padre MERSENNE. 

GALILEO rimase fermo nel suo atteggiamento per tutta la sua attività di 
scrittore e di scienziato. Quella che egli chiamò dilancetta fu chiamata idro- 
stammo dagli Accademici del Cimento ed è stato dimostrato (1%) che molte 
voci, che compaiono nella Crusca e nel TOMMASEO come attestate per la prima 
volta in GALILEO sono dei suoi predecessori: ascensionale è già in Cosimo 
BARTOLI, gravità nel TARTAGLIA, nebulosa in EGNAZIO DANTI, così come selino- 
grafia e selinografo risalgono a BACONE (1°). 

Come GALILEO non ha creato termini nuovi, così ha contribuito a seppel- 
lirne parecchi: è tutta una massa di errori che viene battuta inesorabilmente. 


(3) KO PARCO Dts 

(15) Per « pendolo », vedi Massimi sistemi in VII, p. 177 e 256. 
(16) Dal MIGLIORINI, op. cit., p. 150. 

(17) Novum Organum, (1620); II, XXXIX: « selenographia ». 
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Così anche lo studio della terminologia di GALILEO si conclude con una 
lezione di misura. È, questo, uno dei caratteri che distinguono appunto la 
lingua di GALILEO da quella dei suoi scolari e seguaci. La grandezza, l’afflato 
poetico sempre rigorosamente misurato e controllato, diventa, nella pros: 
scientifica del °600, in VIVIANI, nel CASTELLI, nel MAGALOTTI, nel REDI, nel 
BELLINI, decoro, e perfino piacevolezza e colore o trapunto d'immagini non 
disgiunte da accademismo. Resta l'efficacia, la dignitosa chiarezza: non vi è 
più la magnanima commozione, l’ammirato stupore di chi ha rivelato per primo 
all'umanità mondi nuovi, lontanissimi orizzonti, di chi ha vissuto tutta la vita 
nella scoperta e nella contemplazione di nuove verità. 

La prosa di GALILEO, sorta in una iniziale simbiosi di latino e volgare, rag- 
giunse, nelle scoperte che egli quotidianamente faceva, una commossa perfe- 
zione, una vivacità e una robustezza quali ben di rado si riscontrano nei nostri 
scrittori. La duttilità del suo mezzo espressivo ci conduce quasi a identificare 
la sua realtà scientifica con un favoloso mondo in cui nebulose e stelle, leggi 
fisiche e osservazioni matematiche si vestono di un alone di leggenda. Nelle 
soste più felici della sua osservazione dell’immensità dei cieli, dello spazio infi- 
nito e nella contemplazione degli esseri più piccoli, GALILEO è sempre presente 
con la sua acutissima mente (XIII, p. 208): 


«Io ho contemplati [col microscopio] moltissimi animalucci con infi- 
nita ammirazione: tra i quali la pulce è orribilissima, la zanzara e la 
tignuola son bellissimi; e con gran contento ho veduto come faccino le 
mosche et altri animalucci a camminare attaccati a’ specchi, et anco 
di sotto in su.... In somma ci è da contemplare infinitamente la gran- 
dezza della natura, e quanto sottilmente ella lavora e con quanta indi- 
cibil diligenza ». 


Oppure (X, p. 280): 


«Io mi trovo al presente in Venezia per fare stampare alcune osser- 
vazioni, le quali col mezo di un mio occhiale ho fatte ne i corpi celesti; 
et sì come sono di infinito stupore, così infinitamente rendo grazie a Dio, 
che si sia compiaciuto di far me solo primo osservatore di cosa ammi- 
randa et tenuta a tutti i secoli occulta ». 


Poi, per quegli occhi che avevano spaziato nell’infinito, venne la più atroce 
condanna, la cecità. Nel gennaio 1638 così GALILEO annunziava al DIODATI 
la sua sventura (XVII, p. 247): 


«Signor mio, il Galileo, vostro caro amico e servitore, è fatto irre- 
parabilmente da un mese in qua del tutto cieco. Or pensi V. S. in quale 
afflizione io mi ritrovo, mentre che vo considerando che quel cielo, quel 
mondo e quello universo che io con mie maravigliose osservazioni e chiare 
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dimostrazioni avevo ampliato per cento e mille volte più del comune- 


mente veduto da’ sapienti di tutti i secoli passati, ora per me s’é sì dimi- — 


nuito e ristretto, ch’e’ non è maggiore di quel che occupa la persona 
mia ». 


È stato detto che nessuno scienziato dei tempi moderni è al tempo stesso 
un grande scrittore come GALILEO. Queste parole, così piene di pietà, così 
contenute nel loro dolore ci fanno ancor più persuasi della grandezza del suo 
animo e ci avvicinano alla sua tormentata eppur serena umanità senza la quale 
non si potrebbe spiegare la singolare grandezza del suo stile, l’indimenticabile 
ed insostituibile valore della sua esperienza di scienziato e di scrittore. 
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«O frati,» dissi, «che per cento milia 
perigli siete giunti all’occidente, 
a questa tanto picciola vigilia 


de’ vostri sensi, ch’é del rimanente, 
non vogliate negar l’esperienza, 
diretro al sol, del mondo sanza gente. 


Considerate la vostra semenza: 
fatti non foste a viver come bruti, 
ma per seguir virtute e canoscenza ». 


DANTE, Divina Commedia, 
Inferno, X XVI, 112-120. 


With the death of ALBERT EINSTEIN on April 18, 1955, a world-figure 
leaves the scene who will be remembered by future historians as the most 
profound genius of our era and one of the greatest thinkers of all ages. He 
opened to the human mind new perspectives of unprecedented grandeur and 
demonstrated the comprehensibility of the physical universe to a degree which 
was not surmised before. 

The present article is not devoted to the man EINSTEIN, to the human 
personality who was of extraordinary stature in itself. The many popular 
articles he wrote on urgent ethical, political and philosophical issues of our 
days [42, 43] show the same earnestness and penetrating power of the mind 
that characterize his scientific writings. They are, however, outside the scope 
of our present discussions which are solely devoted to EINSTEIN the physicist. 
But even as a physicist he was the originator of so many profound disco- 
veries that their due evaluation would require a volume. If we concentrate 
here on a brief sketch of his scientific achievements, we do that in the hope 
that the very brevity of the enumeration will give evidence of the astonishing 


scope of his scientific results. 
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Tf one studies the scientific output of ALBERT EINSTEIN, one is immediately 
struck by the dual strain of his meditations: EINSTEIN, the ingenious statis- 
tician, and EINSTEIN, the ingenious geometer. Here are two modes of thinking 
which are very seldom united in one person’s mind to such an outstanding 
degree. But in the magnificent early investigations of EINSTEIN both strains 
run parallel and his contributions in both directions were equally profound. 
While it was the dazzling discovery of general relativity which singled him 
out to that pedestal that he will occupy forever in the history of scientific 
and philosophical thought, yet the development of contemporary physics in 
its quantum aspects was even stronger influenced by his penetrating analyses 
of the radiation-matter problem. If we deal here more in detail with his rela- 
tivistic speculations, we do that partly because he himself turned in the second 
half of his life more and more exclusively to this field of thought, and partly 
because the final impact of his genius on man’s intellectual evolution will in 
all probability reside in that realm. It would be injust, however, to omit 
altogether that side of EINSTEIN’s work which would have made him one of 
the leading physicists of his era, even without any of his relativistic discoveries. 

EINSTEIN belonged to that group of great scientists who like to submerge 
in a problem unprejudiced by the opinions of other workers in the field. The 
mere reading of literature and the mere absorption of scientific results in ready- 
made form give them little satisfaction; (a habit frequently shared by lesser 
minds). It is not so surprising that the university student EINSTEIN did not 
excel to any undue degree. He was not interested in the essentially formalistic 
mathematical courses to which he was exposed. He was not attracted to the 
intricacies of mathematical formalism and mathematics for its own sake did 
not excite his enthusiasm — in marked contrast to his last years, when he was 
inclined to trust purely formal relations perhaps too strongly. His mind was 
glued on the physical structure of the universe and he developed an almost 
uncanny intuition which allowed him to roam freely in the land of physical 
concepts, deducing new relations by a most sparing use of mathematical tools. 
EINSTEIN never cared for techniques and never cared for details which had 
no relation to something fundamental. This put his papers in a special cate- 
gory and was responsible for the peculiar under-estimation which many scientists 
extended to them. In many scientific treatises of today, and in some histo- 
rical expositions of contemporary physics, EINSTEIN’s role is astonishingly 
under-rated. In our over-technicized era most scientists become proficient in 
certain techniques and value technical achievements more than anything else. 
Achievements of a sweeping nature, obtained by ingenious intuition rather 
than by systematic drudgery, are suspected as vague and philosophical rather 
than strictly scientific. But fortunately for EINSTEIN’s personal fate, the 
greatest minds of European physics, particularly PLANCK, von LAUR and 
SOMMERFELD in Germany, later RUTHERFORD and EppINGTON in England, 


stage 
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LANGEVIN and DE BROGLIE in France, LORENTZ in Holland, recognized the 
extraordinary stature of EINSTEIN’s genius and gave him credit where credit 
was due. The Commemorating Volume published at the occasion of his 
seventieth birthday in 1949 [44] (+), contained articles from an international 
galaxy of physicists and the number of dissenters was negligible. 


1, — Thermodynamics was a subject which fascinated EINSTEIN particu- 
larly, because of its universal validity. Thermodynamics is a science which 
permits us to establish relations between the various physical and chemical 
properties of matter on the basis of a few very general principles, without 
going into the specific mechanisms which are the real originators of these re- 
lations. This is made possible by the very large number of essentially identical 
material particles which cooperate in a statistical way to bring about a certain 
macroscopically measurable effect. The pyramid is there, while the millions 
of slaves whose painstaking efforts collaborated in erecting the pyramid, remain 
invisible. Around the beginning of this century, in the student days of the 
young EINSTEIN, many physicists doubted the existence of atoms which they 
considered a figment of scientific imagination, in spite of the great results of 
the kinetic theory of gases, developed by MAXWELL and later by BOLTZMANN 
and GiBBs. The methods of statistical mechanics, which dealt with the sta- 
tistical interaction of a huge number of submicroscopic particles, established 
a perfect counterpart to classical thermodynamics by showing the atomistic- 
kinetic background of the thermodynamical principles. But a direct physical 
proof of the existence of atoms was not available yet. In his first thermo- 
dynamical investigations EINSTEIN rediscovered certain fundamental relations 
which were given earlier by BOLTZMANN and GIBBS [1, 2, 3]. He was not 
satisfied, however, with the microscopic justification of the macroscopic picture 
but tried to come to results which would bring the kinetic action of the mole- 
cules in direct evidence. In the memorable year 1905, which saw the birth 
of three of EINSTEIN’s great discoveries,—the Brownian motion, the light 
quanta hypothesis and special relativity—he succeeded with his plans [5]. 
He applied the statistical theory to a particle suspended in a fluid and found 
that the irregular heat-action of colliding molecules is the source of a diffusion 


equation with the diffusion constant 


1eAE JI (n = viscosity coefficient 
N Gana a = radius of particle) . 


The result of this diffusion equation is that the average displacement of the 
suspended particle will not be zero but a quantity which increases with the 

(1) Cf. also: In Commemoration of the Seventieth Birthday of Albert Einstein, March 14, 
1949, in Rev. Mod. Phys., 21, 343-540 (1949). 
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square root of the time ¢; (for a time ¢ which is not too small): 


A, = Va =V2DI. 


Actual measurements of the peculiar trembling motion of suspended part- 
icles—called the Brownian motion—fully corroborated Einstein’s equation and 
palpably demonstrated the existence of atoms and molecules, converting to 
the atomic hypothesis all the physicists who previously took a sceptical stand. 
But EINSTEIN realized that more is at stake. It was the investigation of 
Planck’s radiation law which stimulated him to the highest flights of his sta- 
tistical discoveries. He conceived the idea that instead of using the micro- 
scopic picture as an explanation of the workings of thermodynamics, one 
might revert the procedure and draw from the macroscopic thermodynamical 
relations conclusions concerning the structure of the microscopic mechanism. 
PLANCK deduced his radiation law from the assumption that energy is not 
emitted in continuous quantities but in packages which are multiples of hy. 
Tt seemed unnecessary to make a more detailed picture of the radiation mecha- 
nism; in particular, one could assume that the Maxwellian equations hold in 
vacuum and the quantum laws come into operation only in the interaction 
between matter and radiation. But EINSTEIN inquired into the forces which 
act on a material particle suspended in a field of radiation and found that a 
thermodynamical equilibrium between matter and radiation could never come 
about if the Maxwellian equations would hold exactly. He evaluated the 
energy fluctuations which must exist in a small volume of the radiation field. 
These energy fluctuations are of fundamental importance since it is their effect 
on the material particle which makes the equilibrium between radiation and 
matter possible. The average value of the energy fluctuations in the frequency 
range (vy, v+dv) could be calculated on the basis of the following thermo- 
dynamical relation [10]: 


se CE 
32 — [2 | —— E 
du (=z), 


Planck’s energy law gave the result 


AS CONE? 
e == hy + —__ i ‘ 
8av2dy J 

The second term is the only one which could be explained on the basis of 
Maxwells equations. But its contribution in the realm of high frequencies 
is by far too small to convey the proper statistical energy to the suspended 
material particle. 

The first term had exactly the same form as the energy fluctuation of a 
gas with particles of the energy hy. This resulted in the amazing picture of 
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a «light quantum » (now called «photon »), as a small corpuscle endowed with 
the energy hy and moving with light velocity e. How this corpuscular picture 
could be harmonized with the wave picture of radiation, was (and essentially 
remained) an unsolved riddle but EINSTEIN had the boldness of mind to re- 
cognize that the hypothesis has to be made, in spite of its Stiangeness [4]. 
He immediately applied it to a theoretical explanation of the photo-electric 
effect, and later to the high-frequency limit of the continuous spectrum emitted 
by a fast electron when it is captured. 

: JEINSTEIN’s next problem was whether the absorption and re-emission of 
light energy by an oscillator could be conceived as a purely scalar effect, com- 
parable to the emission of a spherical wave, or whether it had directional 
qualities. He could show convincingly that both the emission and absorption 
of light must be conceived as a directional effect by forming the picture of 
a light quantum as a corpuscle which carries the momentum hy/e in the di- 
rection of its propagation. 

In the meantime he made the first fundamental advance in the field of 
specific heat by applying Planck’s energy law to the elastic oscillations which 
are present in a solid body. EINSTEIN simplified the picture by assuming one 
single mode of vibration only, yet he obtained a dependency of the specific 

heat on the temperature which was correct in its main character and came 
close to the physical data in the realm of very low temperatures [8] (?). 

A very beautiful contribution to Planck’s radiation law was obtained by 
EINSTEIN when under the impact of Bohr’s theory he pondered on an inter- 

ton mechanism between matter and radiation which is in harmony with 
the general features of Bohr’s atom model [20]. He utilized only the most 
general properties of Bohr’s theory, in particular the assumption that a part- 

 icle can possess only certain discrete energy levels ¢,. Between two energy 
levels a certain transition probability 47, is assumed by EINSTEIN, ie. a 
certain percentage of molecules (per unit time) can spontaneously change 
over from e,, to e, if en > én, emitting the energy en — é,. On the other hand 
| a certain transition probability B7 exists in the opposite direction, absorbing 
the corresponding en — €. This transition requires, of course, the presente 
of a certain radiation density o(v, 7); the percentage of particles which parti- 
| cipates in the transition will be proportional to B,'0. Bae the opposite process, 

«induced emission », (in addition to spontaneous emission) Cane take pla too 

and will be proportional to B" o. Finally we need the statistical weights pu 


and p, of the two energy states €, and En. 


i 0 1 rer & > elastic vibrations was later carried out 
(2) The summation procedure over all the elastic é 


é 1 ntly by M. Born and 
by P. DEBYE (Ann. der Phys., 39, 789 (1912)) and independently by M. Born anc 


TH. von Karman (Phys. Zeits., 18, 297 (1912)). 
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Now the condition of radiation equilibirum gives 


Em 
(Bm0 =F A”m) ? 


kT kT 


En mm 
Dae <p È a Bn O = Pmé EXP 


to which we can add the relation 


Pa Bo sù Di ba ? 


which follows from the natural assumption that the radiation density o(v, 7’) | 
should go to infinity as ZY goes to infinity. From these two fundamental | 
relations we obtain 


5 ARIDI 
» exp [(Em pe) En) [KT] Lil 


or, 18) 


On the other hand, we know from general thermodynamics (independently 
of any specific exchange mechanism between radiation and matter) that Wien’s , 
displacement law must hold as to the general structure of o(v, 1): 


v 
o(v, T) = (3) : 
This immediately gives as the only possibilities: 


AS == oy Be 


m m 


and 


Em — En = hv 


with the two constants « and h. We have thus obtained from very general 
considerations Planck’s radiation law, plus the fundamental Bohr condition 

which determines the emitted (or absorbed) frequency between two energy 

States. 

The last ingenious contribution of EINSTEIN to statistical thermodynamics 
was instigated by an idea of the Indian physicist S. N. Bose who demons- 
trated (*) how Planck’s radiation law could be obtained as a consequence of 
a corpuscular statistics in which the light quantum particles are considered 
as indistinguishable from each other. EINSTEIN recognized the analogy bet- 
ween an ideal monoatomic gas and a radiation field [23, 24]. He showed how 
the classical thermodynamics of an ideal gas corresponded to a radiation law 


(*) Zeits. f. Phys., 26, 178 (1924); 27, 384 (1924). 
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of the Wien type which violated the Nernst theorem. The new statistics cor- 
rected this behavior to a thermodynamical law which corresponded to Planck’s 
law and which removed the contradiction to the Nernst theorem [24]. The 
«gas degeneracy » thus discovered by EINSTEIN became fundamental for the 
understanding of the statistical properties of free electrons inside of a con- 
ducting solid body. 

Somewhat aside from these statistical considerations and yet of great histo- 
vical significance was EINSTEIN’s formulation of the Sommerfeld-Wilson quan- 
tum conditions in terms of Hamilton’s S-function [19]. The accidental feature 
of the quantum conditions for separable systems was thus replaced by an 
invariant formulation which could be applied to non-separable systems too. 
Hinstein’s general formulation of the quantum conditions in the form of one 
unified and invariant principle was interpreted by DE BROGLIE as a resonance 
condition for waves. Hence the DE BROGLIE matter waves take their origin 
from Einstein’s invariant quantum condition. De Broglie’s matter waves, on 
the other hand, culminated in Schrédinger’s wave-mechanics. Hence we see 
the direct historical sequence: EINSTEIN-DE BROGLIB-SCHRODINGER. 


2. — The discovery of analytical geometry by DESCARTES was a landmark 
in the history of mathematical thought. It made it possible to reduce geo- 
metry to algebra and solve all problems of geometry by purely symbolic tools. 
The «coordinates » are a pair of numbers a, y (in space a triplet of numbers 
x, Y, 2) which are coordinated to a point P and which can replace the point P 
for all operational purposes. 

A student of analytical geometry quickly masters the notion of Cartesian 
coordinates and finds nothing mysterious in them. EINSTEIN wondered about 
many things that other people took for granted and this was his strength. 
He was asked once how it happened that he and not somebody else discovered 
relativity, His answer was that other people wondered about the properties 
of space in their early youth but in their mature years saw no problems anymore, 
He, however, came in touch with the problems of space at a much later age 
and therefore could ask questions that other people overlooked [47, p. 84]. 

The question of coordinates disturbed EINSTEIN since his early youth. 
Coordinates are a wonderful tocl in the description of physical events. But 
why is it that the role of these coordinates is so very different in pure geo- 
metry and in physics? In order to define our coordinates we first have to 
establish a definite set of awes to which these coordinates are related. These 
axes do not belong to the given geometrical problem but can be chosen in 
any orientation we like. Any time we want we can abandon one set of co- 
ordinates in favor of another set. Each set of coordinates establishes a definite 
«reference system » but all these reference systems have equal validity since 
space knows no particular center and no preferential directions. No reference 
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system can be designated as an « absolute system »; on the contrary, all frames ij 
of reference are equivalent relative to each other and of equal a priori validity | 
for the description of geometrical relations; (notwithstanding the fact that. 
in a special problem we may choose our coordinate axes in a way which is | 
particularly suited to the symmetry properties of the given problem). / 

In physics the situation is very different. We may observe the very same | 
event from many different reference systems but the description will by no 
means be equivalent. For example we observe a stone falling to the earth | 
and we describe its motion as one of uniform acceleration. But let us assume | 
that another observer is in a frame of reference which falls with the stone. 
He will say that the stone is constantly at rest. Its velocity does not increase » 
uniformly with the time ¢ but is permanently zero. 

NEWTON was aware of the difficulty but his opinion was that we have no 
right to quarrel with creation on philosophical grounds. He assumed as a 
matter of fact the existence of a certain «absolute frame of reference » in 
which we have to describe the laws of nature. If we depart from this frame, | 
the fundamental laws of nature will not come out properly. 

Actually the absoluteness of the frame of reference was not so extreme in 
Newton’s physics. If we are in a moving train and that train moves on abso- 
lutely smooth tracks and with absolutely constant velocity, there is no instru- 
ment in the world by which the velocity of the train as such could be measured. 
If the curtains are drawn and we make no comparisons with the outer world, 
everything will happen in the moving train exactly as if it were in rest. Hence 
all reference systems which move with constant velocity relative to each other . 
are equally justified for the description of the laws of nature. The equivalence 
of reference systems is violated only if accelerated systems are taken into 
account. 

In NewTON's time the optical and electromagnetic phenomena were not 
yet properly investigated. When during the nineteenth century FARADAY and 
MAXWELL established the field theory of electromagnetism and the identity 
of optical and electromagnetic phenomena was demonstrated, the problem of . 
the absolute reference system reappeared to an even more menacing degree. 
In the realm of optical and electromagnetic phenomena an absolute reference . 
system had to be assumed; the previous leeway of reference systems moving 
relative to each other with constant velocity had to be dropped. Indeed, light 
is a wave motion and these waves propagate in every direction with the same — 
constant velocity c. But if the observer moves relative to the light waves, 
he disturbs the homogeneity of light propagation in various directions and 
this must lead to observable effects by which his motion relative to the absolute — 
reference system (also called « aether ») may be detected. Yet all the mani- | 
fold efforts to demonstrate these effects failed. The famous Michelson-Morley 
experiment of 1887 was the first of a long line of distinguished experiments ; 
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which used indicators of excessive accuracy for the demonstration of the 
absolute motion of the earth through the aether but which failed to detect 
any trace of such motion. Everything occurred exactly in the way as if the 
earth were in rest—in accordance with that relativity principle which in the 
realm of purely mechanical phenomena was well established since NEwTon’s 
time. 
The tremendous puzzle posed by these negative results led to several 
attempts of a theoretical explanation, before EInsTEIN entered the scene. 
G. F. FITZGERALD and H. A. LoRENTz conceived independently the idea that 
the length of a rod is influenced by its absolute motion in such a way that 
the ellipsoidal wave measured out with the help of the moving rod appears 
spherical. The weakness of all these explanations was that they could not 
make it plausible why nature is constructed in such a way that it prevents 
us by all means from discovering the absolute motion through the aether. 
EINSTEIN approached the problem from a fundamental angle [6]. He con- 
ceived the result of the Michelson-Morley experiment and all related expe- 
riments as a clear demonstration of the fact that all frames of reference which 
move relative to each other with uniform velocity, must be equivalent for 
fune description of natural events. Hence the principle of relativity, previously 
established for the realm of mechanical phenomena only, must be extended 
to all phenomena of nature. But then the question has to be investigated, 
what relations must exist between various equivalent reference systems. Let 
2, y, <, t be the coordinates used by one observer and a’, y’, 2‘, t' the coordi- 
nates used by another observer who is in relative motion to the previous 
observer. It was always our custom to put intuitively 


7 t=, 


i.e. to assume that time is an absolute, universal variable which has validity 

for the entire universe. It turns out, however, that this cannot be so; we 
must insist that the homogeneous propagation of light in every direction is 
equally valid for all observers who are in equivalent reference systems. This 
has the consequence that the equation of uniform light propagation 


o + y? + e— ec = 0 


must remain valid in the new reference system, if it was valid in the original 
system. This leads to the relation 


nye ee or = hae ey” Lal), 


Although k is not necessarily 1, yet the natural demand that the transform- 
ation formulae shall be linear, together with the condition that the trans- 
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formation shall be reversible without any preference for the one or the other 
system, permits us to normalize % to 1. Thus we obtain the principle that 
the relations between equivalent reference systems are those linear relations 
between the coordintaes which leave the quadratic form 


Wie a OF + 32 Pe ct? 


invariant. Such transformations occurred earlier in consequence of certain 
mathematical properties of the Maxwellian equations and were investigated 
by Lorentz and by PoIrncarÉ. But it was EINSTEIN who discovered the 
proper interpretation of these transformations as relations between coordinates 
which belong to equivalent frames of reference. Thus it was shown that time 
has no absolute significance but is merely a fourth coordinate, to be added 
to the three space coordinates. 

EINSTEIN drew all the fundamental physical consequences from his discovery. 
The Newtonian equations of motion, and in fact all equations of physics had 
to be revised, in order to bring them in harmony with the relativity principle. 
If a certain equation was given in a certain frame of reference—e.g. the frame 
in which the particle was in momentary rest—one could immediately trans- 
form it in a unique way to any other equivalent frame. The new equations 
of motion showed a definite dependence of the mass m on the velocity, ac- 
cording to the law 


Mo 
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The mass became infinite for light velocity which was the highest attainable 
velocity under any circumstances since even an arbitrarily large force acting 
for an arbitrarily long time could never accelerate a particle beyond light 
velocity. Measurements on fast particles corroborated the consequences of 
relativistic mechanics beyond any shade of doubt. 

If an observer B had the velocity « relative to an observer A and an event 
occurred relative to B with the velocity v, the velocity of this event relative 
to A was not u+v but 


This is EINSTEIN’s famous addition theorem of velocities. We can see from it 
that if v is light velocity, w remains light velocity, no matter how large « 
may be. In relativity theory light velocity plays the same role as an infinite | 
velocity plays in ordinary mechanics. | 
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In the year 1907 [9] BinsrEIN drew the most important consequence of 
his theory, viz. the equivalence of mass and energy: 


BM. 


It expresses the principle that every mass by its very existence is the seat 
of a tremendous amount of energy which under ordinary circumstances re- 
mains dormant, due to the high stability of my. In certain nuclear processes, 
however, involving fission or fusion, a transformation of mass into radiation 
takes place which means the release of an enormous amount of energy. The 
terrible destructive power of the Atombomb—which converted only a few 
ounces of matter into heat, creating an artificial sun of several billion centi- 
grades—verified Einstein’s equation in a tragic way. 

The converse effect, viz. that every energy has an equivalent mass value, 
was deduced by EINSTEIN still in 1905 [7]. He showed that a body which 
receives the energy £ in any form whatsoever, must increase its mass according 
to the equation 


vy 
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In 1908 HERMANN MINKOWSKI made a very fundamental mathematical 
discovery by giving a metrical interpretation of Einstein’s theory (+). He 
showed that the reason for the constancy of light velocity is not a physical 
but a geometrical fact. It is a direct consequence of the circumstance that 
the world of physics has a metrical structure which includes space and time 
into one unified manifold. The square of the distance element of this mani- 
fold is not the ordinary Euclidean expression dx?-+ dy?-+- de? but the extended 
expression 


isa 


The equations of physics are such that they comply with the homogeneity 
of a four-dimensional instead of a three-dimensional manifold. Space and time 
merge into one united manifold, the role of time being an additional fourth 
dimension. The equivalent reference systems of EINSTEIN are in geometrical 
interpretation various orientations of the fundamental set of four axes, «, Ys %; 
ict, which can be freely rotated in the four-dimensional world. The inertial 
systems of Einstein appear in geometrical formulation as the group of four- 
dimensional rotations which leave the fundamental line-element invariant. 

This was a constructive new principle of fundamental importance which 


(4) Math. Annalen, 68, 472 (1910). 
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served EINSTEIN as a departing point for his later investigations. He freely 
admitted his indebtedness to MINKOWSKI’s geometrical discovery and thus the 

claim that he did not give sufficient credit to MINKowSsKI’s work, is hardly. 
justified. EInsTEIN was not a mathematician, of course, and it took him some 

time before he recognized the fundamental advance achieved by the new geo- 
metrical interpretation. But after that he plunged himself whole-heartedly 

into a further exploration of the new geometry. 

One of the important moves of MINKOWSKI which played an important 
part in the later development of the theory, was the construction of the so- 
called « matter tensor» 7,,. The theory of relativity united the conservation 
laws of momentum and energy into the four components of one single vector 
equation. The question arose, however, how these conservation laws should 
be formulated if energy (or mass) was not lumped in singly particles but 
distributed continuously over the field. The Mawxellian electromagnetic field 
gave a model for such a distribution and also the answer to the problem. The 
Maxwellian stress-components t, = tx; (i,k =1, 2,3) formed a symmetric 
tensor of second order. They had to be extended to a symmetric tensor of 
four dimensions. Hence a fourth row t,; and a fourth column t;;, had to be 
added. In physical interpretation ¢,, had the significance of energy density, 
while the three components t,, (¢=1, 2,3) represented the momentum den- 
sity. The three components t,; (È =1, 2, 3), on the other hand, represented the 
energy flux. Already PoINCARÉ postulated in 1900 that every energy flux 
must be associated with a corresponding momentum density. Now the equality 
of energy flux and momentum density became the consequence of the sym- 
metry of the matter tensor. i 

Quite generally, momentum and energy distributed over a field had to be — 
conceived as the components of a symmetric tensor of second order 7, = Ty. 
The conservation of momentum and energy was expressed by the mathematical 
equation that the divergences of this tensor vanished: 


OP 
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3. Relentlessly the demon drove EINSTEIN to newer and newer efforts. The | 
principle of relativity established the equivalence of reference systems, but so 
far on a very restricted scale only. Why should uniform motions be preferred? 
Why not all motions? It should be possible to formulate the laws of nature Li 
in such a way that no reference system would be a priori distinguished. The 
fundamental laws of nature should be equally describable in all reference 
systems. This was the principle of general relativity which offered a tremendous | 
challenge to EINSTEIN but he knew that he will find no peace before he arrives | 
at the solution of this problem. And here started that dogged up-hill fight | 


pitting oe 


INSERZIONI 


SOCIETA ITALIANA DI FISICA eh ra 
SCUOLA INTERNAZIONALE DI FISICA 


Sotto gli auspici del Ministero della Pubblica Istruzione 
e del Consiglio Nazionale delle Ricerche 


i 4° Corso ESTIVO A VARENNA, SUL LAGO DI Como ni 
(ia 15 Luglio - 4 Agosto 1956 VI 


SR Quest'anno 1956, col contributo del Ministero della Pubblica Istruzione, del Con-. i SIT a 
siglio Nazionale delle Ricerche, del Comitato Nazionale per le Ricerche Nucleari, UR 


LA dell’Università di Milano, delle Autorità, di Enti, Società e privati della Provincia 
fe ~ di Como, la Scuola terrà a Varenna, nella Villa Monastero, messa Canoe EM e 
È disposizione dall’Ente Villa Monastero, il 4° Corso estivo. "AR IAIIR 


_ Il Corso sarà dedicato alle proprietà magnetiche della materia. In particolare ver- 

b ranno organicamente trattati i seguenti argomenti: i 
i _ — Paramagnétismo nei cristalli; n 
| — Rilassamento paramagnetico; i id DAR me 
— Ferromagnetismo, ferrimagnetismo, antiferromagnetismo ; 
a "== Risonanza paramagnetica, ees ale a mae A e diamagnetica; pigri 


_ Diu della riga nella risonanza magnetica; i 
_— Il concetto di temperatura nei fenomeni magnetici; 
|__— Magnetismo a basse temperature, polarizzazione nucleare; 
_——@ Proprietà magnetiche dei superconduttori. Mali 
. Le lezioni verranno affidate ai seguenti professori: 
C. J. Gorrer, Kamerlingh Onnes Laboratorium der Rijksuniversiteit, Leiden. it a 
.C. KirtEL, University of California, Berkeley, California (U.S.A.). | RE ae 
; N. Kurt, Clarendon Laboratory, Oxford. 
f ko | L. NfeL, Laboratoire d’Électrostatique et de Physique du Métal, Grenoble. Paar CONAI 
|M H. L. Pryce, H. H. Wills Physical Laboratory, Bristol. 
è E. M. PurcELL, Harvard University, Cambridge, Massachusetts (U.S.A.). | 
J. H. VAN VLECK, Harvard University, Cambridge, Massachusetts (U.S.A.). 
Altri specialisti terranno seminari e conferenze su argomenti connessi con quelli 
. che sono l’oggetto del Corso. 
À Le lezioni, i seminari, le conferenze e le discussioni saranno generalmente tenute 


4 in inglese o in francese. 
as 


2. L’organizzazione e la Direzione del Corso sono affidate al prof. Lurci GiuLorro, 
È . professore di Fisica Superiore all’ Università di Pavia. 

3. La durata del Corso sarà di 21 giorni. L’inaugurazione avrà luogo Domenica 
q 15 Luglio alle ore 18 e la chiusura nel pomeriggio di Sabato 4 Agosto. 


4. Il numero totale degli allievi sarà di 35. 
Chi desidera partecipare al Corso dovrà inviare entro il 31 Maggio 1956 la domanda al 


È Prof. LUIGI! GIULOTTO 
a Istituto di Fisica dell’ Universita’ 
A! 


Bs PAVIA (Italia) 


Il 


SI RA pone te gli 
più NUOVO. ‘cento. 


fornendo, ben chiaramente, le seguenti indicazioni: 

a) Nome e Cognome. 

b) Data e luogo di nascita. 

c) Indirizzo attuale. 

d) Titoli di studio universitari ottenuti e presso quale Università. 
e) Professione attuale. 

f) Elenco delle pubblicazioni di Fisica, 

Quale conoscenza orale e scritta ha dell'inglese e del francese. 


aS 


Inoltre, il richiedente deve allegare alla domanda una lettera di presentazione, rila- 


sciata da un professore universitario di Fisica, il quale assicuri che il richiedente ~ 
ha interesse per i suoi studi a frequentare il Corso e che possiede già adeguata 


preparazione. = 
L'accettazione delle domande è decisa dalla Presidenza della Società unitamente 


alla Direzione del Corso, tenendo conto dei documenti presentati dai singoli richie-. 


denti e di una equa ripartizione dei posti tra le varie nazioni cui questi appar- 
tengono. 


Le comunicazioni delle deliberazioni saranno inviate agli interessati entro il. 


10 Giugno 1956. 


I 


Gli allievi saranno alloggiati o nella foresteria delia Villa Monastero o. in alberghi ‘ 


) Se verrà a Varenna da solo o accompagnato da familiari e, nel caso che venga. 
da solo, se è disposto a prendere alloggio in camera in comune con altro allievo. 


di Varenna, vicino alla Villa, in camere da 1 o 2 letti. Nella stessa Villa o in un |) 


albergo di Varenna sara. organizzata la mensa della Scuola. a 


’ 


Grazie alle sovvenzioni di cui la Scuola dispone, la spesa totale che ogni allievo — 
deve sostenere per la frequenza al Corso, alloggio e vitto alla mensa della Segn 


è limitata a L. 30000, se egli prende alloggio in camera da solo, e a L. 25000, 


se prende alloggio in camera con altro allievo o con familiari. Dette somme sono È 


da versarsi, non più tardi del 23 Luglio, all’Amministrazione della Peuolas a Vail 
renna, in valuta italiana. 


Borse di studio, in numero molto limitato, potranno essere accordate a quegli. 


allievi che ne giustifichino il bisogno. 


’ 


La Scuola cercherà, nei limiti delle possibilità locali (che sono però assai limitate 
causa la stagione balneare), di trovare una sistemazione adeguata in alberghi di 


Varenna per i familiari che intendessero accompagnare gli allievi. 

I familiari possono essere accolti, insieme con l’allievo, alla mensa della Scuola. 
Tutte le spese per alloggio, vitto, ecc., relative ai familiari sono da computarsi a 
parte e totalmente a carico dell’allievo: esse, per trattamento uguale a quello fatto 
all'allievo, si aggirano intorno a L. 3000 per giorno e per persona adulta, e do- 
vranno essere regolate direttamente dall’allievo con l’albergatore. 


Per ogni informazione relativa al Corso rivolgersi direttamente al Direttore di | 


questo. 


Milano, 25 Febbraio 1956 


Segretario della S.I.F. Il Direttore del Corso 


G. C. Datta NocE L. GruLorro 


Il Presidente della S.I.F. 
G. POLVANI 
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ITALIAN PHYSICAL SOCIETY _ 
INTERNATIONAL SCHOOL OF PHYSICS 


Under the auspices of the Ministero della Pubblica Istruzione 
and: the Consiglio Nazionale delle Ricerche 


4° SUMMER COURSE TO BE HELD IN VARENNA, LAKE Como 


we _ 15th July - 4th August 1956 

Ee 1. This year, 1956, thanks to contributions from the Ministero della Pubblica Istru- 

po zione, the Consiglio Nazionale delle Ricerche, the Comitato N azionale per le Ri- 

di cerche Nucleari, the University of Milan, and Authorities, Companies and private 
È individuals of the Province of Como, the 4-th Summer Course will be held at Va- 

ke ~ renna in the Villa Monastero, the use of which has been ace granted DI the Ente 

(aa Villa Monastero. 


following subjects will be organically treated: 
— Paramagnetism in crystals; 
— Paramagnetic relaxation; 
— Ferromagnetism, ferrimagnetism and antiferromagnetism; 
— Paramagnetic, ferromagnetic, antiferromagnetie and cyclotron resonance; 
_— Nuclear magnetic resonance in liquids and in crystals; 
— Line-widths in magnetic resonance; j 
_ — The concept of temperature in magnetic phenomena; 
— Magnetism at low temperatures, nuclear polarisation; 
_ — Magnetic properties of superconductors. 
_ The lectures will be given by following Professors: 
C. J. Gorter, Kamerlingh Onnes Laboratorium der Rijksuniversiteit, Leiden. 
C. Kitren, University of California, Berkeley, California (U.S.A.). 
N. KurtI, Clarendon Laboratory, Oxford. E 
L. NéreL, Laboratoire d’Électrostatique et de Physique-du Métal, Grenoble. 
.M. H. Pryce, H. H. Wills Physical Laboratory, Bristol. 
\ E. M. PuroeLL, Harvard University, Cambridge, Massachusetts (U.S.A.). 
J. H. VAN VLECK, Harvard University, Cambridge, Massachusetts (U.S.A.). 
Other specialists will hold seminars and lectures on questions related to the basic 
% » subjects of the course. 


i The lectures, seminars, conferences and discussions will be held generally in agian 
È or French. 

| 2. The person intrusted with the organisation and Direction of the Course is Mr. Louie 
“MI GiuLorto, Professor at the University of Pavia. 

8. The Course will last 21 days. It will open at 6. 00 p.m. on Sunday the 15-th July 
] “and will close on the afternoon of Saturday the 4-th August. 

È 4. The total number of students will be 35. 

: Those wishing to attend the course shoulds end not later than 31-st of May 1956 
4 . an application to: 


Prof. LUIGI GIULOTTO 
| her rat Istituto di Fisica dell’ Universita 


Mo" PAVIA (Italy) 


The Course will be devoted to the magnetic properties of matter. In particular the 


ila MI Ple BA Shy OR er 


r } 


vige NUOVO ‘cmENTO | 


and should furnish the following information, which should be set out clearly and 

legibly: 

a) Christian name and surname. 

b) Date and place of birth. 

c) Present address. 

d) Degrees and other qualifications obtained, with name of university in each case. 

e) Present professional activity. 

f) List of publication in physics. 

g) Standard of knowledge of English and French — written and spoken. È 

h) Whether intending to stay at Varenna unaccompanied or with members of family 
and in the former case, whether willing to share a room with other students. 

The applicant should send, together with his application, a note of reference from 

an university professor of Physics, to testify for the applicant’s interest and pre- 

paration in the activities of the School. 

Each application will be considered by the President of the Italian Physical Society — 

and by the Direction of the School on the basis of the information submitted, with 

regard also to a fair distribution of the places available, among students of various _ 

Nations. The decisions on the admittance to the School will be made known to 

the applicants by 10th June 1956. 


5. Accomodation for the students will be provided in rooms with 1 or 2 beds either 
in the guest quarters of the Villa Monastero or in hotels at Varenna, near the Villa. — 
Meals will be taken at the villa itself or at a hotel in Varenna. pie 4. | 


6. Thanks to the grants made available to the School the total fee (for attendance, 
hotel accomodation and full board) will be 30000 Lire for each student having | 
single hotel accomodation, and 25000 Lire for each student sharing hotel acco- | 
dation with other students or with members of his family. Fees should be paid to 
the management of the School, at Varenna, in Italian currency, not later than the E 
23rd of July. 

A very limited number of scholarships may be granted to those students whose 
economic conditions might otherwise prevent them from attending the School. 


7. The School will do everything possible to find suitable accomodation in local hotels 

for members of families accompanying the students. It should be noted, however, 
that, in view of the holiday season, local possibilities are limited. 
Members of student’s families may avail themselves of the catering arrangements 
being organised for the School. 
All expenditure involved in the hotel accommodation, board ete., for student’ SI 
relatives will be payable separately. These expenses, covering full service similar 
to that enjoyed by the student, will be of about 3000 Lire per adult person 
per day and are to be settled directly with the hotel management. 


8. For all further information regarding the School, application should be made directly 
to Professor LuIGI GruLorro. 3 
A 


Milan, the 25-th February 1956 


The Secretary of the Society 


The Director of the Summer Course 
G. C. DALLA NOCE 


L. GiuLoTro 


The President of the Society 
G. POLVANI 
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IL NUOVO CIMENTO 


ONORANZE AD AMEDEO AVOGADRO 


CONGRESSO INTERNAZIONALE SULLE COSTANTI FONDAMENTALI 
È E 


XLII CONGRESSO DELLA SOCIETÀ ITALIANA DI FISICA 


TORINO, 6-11 SETTEMBRE 1956 


In occasione del centenario della morte di AMEDEO AvoGapRo, Torino, sua città 
natale, organizza quest'anno un ciclo di manifestazioni per onorare degnamente il 
grande scienziato. 

Nel quadro di queste manifestazioni, la Società Italiana di Fisica organizzerà, sotto 


Ie gli auspici dell’Unione Internazionale di Fisica Pura ed Applicata e del Consiglio Nazio- 


‘nale delle Ricerche, un Congresso Internazionale di Fisica sulle « costanti fondamen- 
tali », che avrà luogo a Torino dal 6 all’11 Settembre 1956. Intendiamo comprendere, 
sotto questa denominazione generale, non soltanto quelle grandezze fisiche che sono 
comunemente chiamate « costanti universali », ma anche le grandezze della fisica ato- 

| mica e nucleare, quali i pesi atomici dei nuclidi, le costanti delle forze nucleari, ecc. 
_ Pertanto il Congresso non tratterà soltanto il problema particolare dei valori più 

| probabili da attribuire alle costanti fondamentali, ma — più generalmente — le que- 
stioni connesse con quei rami della Fisica sperimentale e teorica ai quali si può in 
qualche modo assegnare, come caratteristica, questa o quella costante. E precisa- 


mente:, 

— per la costante di Avogrado: questioni relative alle molecole e ai cristalli; 

| — per la velocità della luce: questioni relative a particelle molto veloci; 

— per la carica elettrica élementare, la costante di Planck e le costanti delle forze 
nucleari: questioni relative ai fenomeni elettrodinamici, alle interazioni mesone- 
nucleone e alla Fisica dei neutrini; 

— e finalmente, per le costanti caratteristiche dei nuclidi: questioni relativi ai pesi 


atomici, ai processi nucleari, ecc. 


Il Congresso sarà evidentemente diviso in diverse Sezioni, che verranno precisate 
più tardi. 

Inoltre, contemporaneamente al Congresso Internazionale, avrà luogo a Torino nella 
sua forma consueta il XLII Congresso della Società Italiana di Fisica. 


Segretario Generale di tutta l’organizzazione è il prof. R. DreaGLIO, Istituto di 
Fisica dell’Università di Torino (via P. Giuria 1, Torino), al quale coloro che deside- 
rano partecipare al Congresso possono rivolgersi per informazioni, non più tardi del 


30 Aprile 1956. 
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IL NUOVO CIMENTO 


CELEBRATION OF AMEDEO AVOGADRO 


4 
INTERNATIONAL CONGRESS ON FUNDAMENTAL CONSTANTS 
AND 


XLII CONGRESS OF THE ITALIAN PHYSICAL SOCIETY 


Turin, 6th.1]th SEPTEMBER, 1956 ¥ 


On occasion of the centenary, 1956, of the death of AMEDEO AVOGADRO, the city 
of Turin, his birth-place, is going to organize celebrations in honour of the great. 
scientist. i 1 :@ 

The Italian Physical Society takes part in these celebrations organizing, under 
the auspices of the International Union of Pure and Applied Physics and of the 
National Research Council, an International Congress on Fundamental Constants | 
of Physics, to be held in Turin from September 6th to 11th 1955. We wish to include, | 
under this general denomination, not only the physical quantities which are generally 
called « universal constants », but also the quantities of atomic and nuclear physics 
such as the atomic weights of nuclides, the constants of nuclear forces, ete. 

The Congress will therefore treat not only the particular problem of the most | 
probable values which are to be attributed to the fundamental constants, but more | 
‘generally, the questions concerning those branches of experimental, and theoretical | 
physics to which one can attach, as a characteristic, this or that constant. And | 
exactly: 


— for the constant of Avogadro: questions concerning molecules and erystals; è | 


— for the velocity of light: questions concerning very fast particles; 


for the elementary electric charge, the constant of Planck and the constants of | 
nuclear forces: questions concerning electrodynamic phenomena, meson-nucleon | 
interactions and the physics of neutrinos; i 


— and lastly, for the characteristic constants of nuclides: questions concerning | 
‘atomie weights, nuclear processes, etc. DI 


Of course, the Congress will be divided into several Sections, which will be. 


specified later. È | 
At the same time the XLII Congress of the Italian Physical Society will take place. | 
in Turin -in its customary form. | 


Secretary General of the whole organization is prof. R. DrAGLIO, Institute of 
Physics of the University of Turin (via P. Giuria 1, Turin), to whom those wishing | 
to attend the Congress may send their application or apply for information not 


later than April 30th 1956. 
\ 
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of EINSTEIN which lasted for ten years and finally came to a victorious end 
in 1915. The hystory of this period is an unprecedented chapter in the eternal 
struggle of the human intellect toward light and freedom. 

Two fortunate circumstances came together. The one was that EINSTEIN 
had an intimate friend in the person of the Swiss mathematician MARCEL GRoss- 
MANN, a mind of quick and flexible perception whom EINSTEIN could consult 
in all questions of mathematical technique. The other was that EINSTEIN 
himself was no matematician. Thus the danger of trivializing the problem 
by accepting a purely formal solution—any mathematical statement can be 
formulated in any arbitrary reference system—was avoided. 

Already early in the game the phenomenon of gravity came in the focus 
of EINSTEIN’s interest. It was natural that going beyond the group of refe- 
rence systems of uniform motion he first chose as simplest example the reference 
Systems which were in uniform acceleration. And now he noticed that the 
introduction of such a reference system generated an apparent field of gravity. 
ETNSTEIN’s famous « equivalence principle » [9] expressed the following thought. 
A physicist makes measurements in a closed laboratory where a uniform field 
of gravity exists, due to the gravitational attraction of the earth. Suddenly 
the earth is blasted away and the field of gravity destroyed. At this moment 
a giant begins to pull the entire laboratory upwards with the uniform acce- 
leration g. Then the physicist will not notice anything of the outside hap- 
penings. Everything will occur in his laboratory exactly as it occurred before 


. the blast. This thought experiment gave an important clue in EINSTEIN’s 


hand. In the apparent gravitational field generated by the pull of the giant 
it is entirely natural that all bodies fall toward the earth with the same acce- 
leration and that the force which they exert on the table on account of their 
weight, is strictly proportional to their inertial mass. Hence the astonishing 
fact, demonstrated to a very high degree of experimental evidence by the 
Hungarian physicist E6ry6s’ gravimetrical measurements, that heavy mass 
and inertial mass are strictly proportional, became a natural consequence of 
the equivalence hypothesis. 

EINSTEIN noticed that the Minkowskian line-element which expressed the 
uniform propagation of electromagnetic waves with the velocity ¢ in every 
direction, changed its form considerably if an accelerated system was intro- 
duced. But even in the most arbitrary systems, which introduced motions of 
a most outlandish character, provided that the principle of continuity was 
preserved, the line-element still maintained a very definite character; it always 
remained a quadratic differential form of the variables, although the coeffi- 
cients were no longer constants but became some more or less complicated 


functions of the coordinates: 
ds? = gi, Ax; day 
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Tf we write a formula in this form, we make use of « Kinstein’s sum convention » — 


which helped so tremendously in taking out the drudgery from the formulae 
of absolute calculus. The sum convention is that we should automatically 
sum over two equal indices, without indicating the summation specifically 
by a Y-sign. 

Now EINSTEIN started to plunge wholeheartedly into the intricacies of 
absolute caleulus which is the mathematical tool for the investigation of ar- 
bitrary reference systems [12]. This branch of mathematical analysis takes its 
departure point from the ingenious geometrical discoveries of GAuss and 
RIEMANN and found further nourishment in the studies of CHRISTOFFEL, 
LipscHITz, BELTRAMI and others. By a fortunate circumstance an elegant 
codification of the entire edifice of «tensor calculus » was given a few years 
earlier (in 1900) by the Italian mathematicians Ricci and LEVI-CIVITA (°). 
Probing into the possibilities of this calculus he became more and more lost 
in a strange land which for the first did not show any similarity to the cus- 
tomary constructions of theoretical physics. But he did not lose courage since 
the equivalence hypothesis showed him clearly that the mystery of universal 
gravity must have something to do with those g,;, quantities which form the 
components of the metrical tensor. The metrical tensor! How peculiar that 
all equations of special relativity, if they were written down in arbitrary co- 
ordinate systems, displayed prominently the metrical tensor. It Jooked as 
if this metrical tensor were part of the problem, as if it belonged to the unknowns 
of the problem. But then, since the g,, are continuous field functions of the 
coordinates, we should expect field equations for their determination. How 


shall we find these equations? They certainly must have « general covariance », _ 


i.e. they must be expressible in any arbitrary frame of reference. Hence they 
must be constructed according to the rules of absolute calculus. The simplest 
covariant differential operator was of the second order and led to the funda- 
mental curvature tenscr of Riemann, R;,,,. If this tensor was put equal to 
zero, one obtained the flat Minkowskian space and that was certainly not 
desirable since then all gravitational fields would be reducible to mere ap- 
parent fields and that is certainly impossible. Now the Riemann tensor can 
actually not be expected to be suitable for the establishment of field equations 
since it overdetermines the problem. It has too many components. In four 
dimensions we have 10 independent g,, and 20 independent R‘,,,. We cannot 
expect 20 equations for 10 unknowns. 

EINSTEIN overcame the difficulty by an ingenious device. A simple con- 
traction reduced the order of the tensor from four to two. He now obtained 
the tensor 


(°) Math. Annalen, 54, 125 (1900). 
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This tensor had exactly the right degree of determination. It was neither 
over-determined nor under-determined. It was a symmetric tensor of second 
order, Just as the tensor g;, = gr; itself. We will thus get 10 equations for 
10 unknowns. And even more miraculously: a slight modification of this 


tensor led to a new tensor which had a further fundamental property. The 
tensor 


Sa = Ra tRGix 5 


was not only a symmetric tensor of second order but its divergence was auto- 
matically zero, expressing a conservation law of the same kind as the con- 
servation law of momentum and energy in field-theoretical formulation. Hence 
a tensor was constructed in a very natural way out of the fundamental ele- 
ments of Riemannian geometry which had exactly the same properties as the 
«matter tensor» of physics, encountered earlier in MINKOWSKI’s work. EIN- 
STEIN could thus establish the relation 
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where x is a constant whose value depends on the units in which lengths and 
weights are measured. Its relation to Newton’s gravitational constant k came 
out as follows: 

820k 


ie eer 


This equation represents a triumph of speculative-constructive thinking 
which is unparalleled in the entire history of science. If special relativity 
(Einstein’s theory of 1905, so called in distinction to his theory of 1915) united 
space and time into one inseparable entity, general relativity now added phy- 
sical matter to the realm of geometry by showig how matter can be con- 
ceived in terms of curvature. The fundamental equation of Newtonian po- 


tential 


Ag =— 470, 


which connected the gravitational potential with the density of matter, was 
enlarged tv a much more comprehensive scheme of ten equations in which 
the scalar density of matter was replaced by the ten components of the matter 
tensor and the scalar Newtonian potential by the ten components of the 
metrical tensor. And yet, when EINSTEIN investigated these equations under 
the simplified conditions which hold if matter is very weak and distributed 
in spherically symmetric fashion, he found that the theory leads to exactly 
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the same conclusions as the much simpler scalar Newtonian theory. The entire 
Newtonian theory, based on the two pillars: 


1) Mass times acceleration equal moving force. 


2) The moving force between two masses is proportional to the product 
of the two masses and inversely proportional to the square of their 
mutual distance, 


was now deducible from mere speculation, including the strict proportionality 
of inertial and gravitating mass which was never explained before [16]. 

The uncanny feature of the new theory was that while before science was 
happy if it succeeded in finding some mathematical law which could describe 
a certain body of observations, now the mathematical law itself became de- 
ducible from purely logical operations. Man in his eternal up-hill fight for 
knowledge has reached a new plateau. 

We have telescoped a dramatic struggle of ten years into the scope of a 
few sentences. This is made possible by the fact that today the entire body 
of knowledge which is needed for Hinstein’s theory, is common property of 
every graduate student of physics. To obtain the proper perspective for the 
immense achievement we should visualize that at the time when EINSTEIN 
started on his journey, the juggling with curvilinear coordinates—a complete 
routine matter today—was hardly known, the absolute calculus was not studied 
and Riemannian geometry was not much more than a mathematical dream. 
To model this untractable material into a shape in which the outlines of a 
new cosmic order dawned on us and finally a whole symphony of new ideas 
dazzled the eye, required more than human efforts. At the end of his essay: 
« Origins of the general theory of relativity » [45] which gives a brief account 
of his struggles, EINSTEIN finishes the narrative with the following words: 
«In the light of finally obtained knowledge the deductions seem almost self- 
evident and can be understood with no great difficulties by any intelligent 
student. But the foreboding search in the dark, year after year, with its 
intense yearnings, its alternation from confidence to despondence, and then the 
ultimate break-through to final clarity, can only be perceived by somebody 
who experiences it himself ». 

More than once he was lured into impassable cul-de-sacs. Once he was 
very near to the final solution and only a small step separated him from 
victory [13]. But when he tried to solve the field equations under the simpli- 
fied conditions which hoid for weak matter [17], he was discouraged by a 
peculiar phenomenon. He expected from the experiences of other field theories 
that the field equations will determine the gj; uniquely if the proper initial 
and boundary conditions are prescribed. But actually an infinity of solutions 
existed, in contradiction to the causality principle of nature. This puzzle 
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caused a relapse of more than two years. In fact he thought that from this 
phenomenon the conclusion may be drawn that the field equations of gravity 
cannot be generally covariant [13], i.e. the principle of general relativity has 
to be sacrificed in favor of some restricted class of reference systems. In actual 
fact it is entirely natural that in view of the arbitrariness of reference systems 
the solution cannot be unique as long as we do not specify the reference system 
we are going to use. A contradiction to causality would only come about if 
more than one solution would exist in one and the same reference system. This, 
however, is not the case. « How simple», we may be inclined to say. But 
how absolutely wrong we are if we try to set up rational standards in an 
unexplored country in which every tree, every blade of gras sassumes a new 
Shape and color and every foreign noise frightens us into panic, although it 
may only be the murmur of a creek. 

After the final break-through EINSTEIN experienced an additional happy 
surprise which filled him with almost ecstatic joy. When he calculated the 
motion of a planet around the sun in second approximation, he found a slight 
deviation from the Newtonian theory in the form of a precession of the peri- 
helion, given by the following expression [14]: 


kM 
Ca. 


= 


e?) ? 


(M= mass of sun, a = half-major axis, e = excentricity). This amount 
evaluated for the planet Mercury, gave exactly that mysterious surplus- 
precession of 43” per century which remained as an unexplicable rest-effect, 
after taking into account all the perturbations (LEVERRIER, 1856). A few 
years ago G. M. CLEMENCE of the U.S. Naval Observatory subjected the entire 
perihelion problem to a critical re-examination. His calculations led to the 
following result, quoted from the author’s summary (°): « The theoretical rela- 
tivity effect of the motion of Mercury’s perihelion is 43.03” 0.03"; the value 
obtained by subtracting all other known effects from the total observed motion 
is 42.56”--0.94. For the earth’s perihelion the corresponding figures are 
3.8’+0.0"” and 4.6” -+-2.7". The confirmation by observation of the relativity 
effects is regarded satisfactory for both Mercury and the earth ». 

Apart from the precession of the perihelion, Hinstein’s theory predicted 
two further optical effects. The one was a red-shift of the spectral lines if the 
emitting atom is on the surface of a star of the radius 7, and the mass M: 


(5) Rev. Mod. Phys., 19, 361 (1947). 
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This red-shift is expressible in the form of v/c where v is an equivalent receding 
velocity which would cause (due to Doppler effect) the same shift of the 
spectral lines as the Einstein gravitational effect. This velocity is on the sun 
only 0.64 km/s. Hence it is difficult to separate this very small effect from other 
Doppler effects, caused by temperature and pressure. It is imperative that 
spectral lines shall be chosen which are singularly sharp and to a large degree 
insensitive to temperature and pressure effects. The originally negative results 
of Cu. E. St. JoHN at the Mt. Wilson Observatory were caused by an improper 
selection of lines. A careful re-examination of the problem gave a complete 
confirmation of the Hinsteinian prediction (7). Even more decisive was the 
spectacular result obtained by W. S. ApAms of the Mt. Wilson Observatory 
on the famous «dark companion » of Sirius (8). Since the distance of Sirius 
is well known, we possess from the luminosity of the star its radius, but we 
also know its mass from its influence on the orbit of Sirius. Hence we can 
calculate the red-shift of the light coming from the dark companion. Since 
the mass is comparable to that of the sun, while the radius is comparable to 
that of the earth, the red-shift becomes magnified by the factor 40, in com- 
parison to the red-shift on the sun. The measurements of ADAMS gave 23 km/s 
as the mean value of the gravitational Doppler effect, compared with the theo- 
retically predicted 26 km/s. 

The second optical effect predicted by Hinstein’s theory is a bending of 
lightrays, by the sun. The effect is twice as big as the bending of a particle 
by the sun, moving with light velocity and calculated by ordinary mechanics. 
The deviation for a light ray which passes the limb of the sun becomes 


4k M È; 
6 = —— =.0.847:1073 = 1"M750 
GAMA 


This amount, observable only at the time of a solar eclipse, is very small and 
taxes the accuracy of the measuring instruments to the utmost. A number 
of astronomical expeditions were organized to test this consequence of Hinstein’s 
theory. Two British expeditions to Sobral in 1919 found that the statistical 
reduction of their plates gave for the deflection constant 1.98” --0.12”, respect- 
ively 1.61"--0.30. The 1922 expedition of the Lick Observatory gave 
1.72 +-0.15. However, the expedition of the Einstein Tower Observatory of 
Potsdam, Germany, gave in 1929 the value 2.24 + 0.10 (°) although Trumpler 
reduced the same observations by a different statistical method and obtained 


(*) Astrophys. Journ.; 67, 195 (1928). For a rejective discussion of the Einstein 
effects cf. E. F. FREUNDLICH: Wissensch. Ann., 1, 35 (1952). 

(*) Proc. of the Nat. Acad. of Science, 11, 382 (1925). 

(°) Zeits. f. Astrophysik, 3, 171 (1931). 
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o.75 + 0.13 @°). The Yerkes Observatory obtained in 1947 the value 2.01 ---0.27, 


in 1952 the value 1.70 + 0.10 (*). The last result is not considered reliable, 
however, because of poor atmospheric conditions. 

Our solar system was not the only beneficiary of ELNSTRIN’s revolutionary 
ideas. The new insights into the geometrical structure of the space-time world 
had radical cosmie consequences as well. If one imagined that the masses 
concentrated in the stars created a certain average mass density in the uni- 
verse, one could see that an average curvature had to be present in the world. 
This average curvature had the consequence that space bent back on itself 
and came to a close in a similar way as a circle comes to a close. Space was 
no longer infinite; it was a boundless but finite space in which a traveller 
eventually returned to his starting point if he continued his journey constantly 
in the same direction. The dimensions of space were huge but not infinite. 
This led to the resolution of certain grave difficulties which are inherent in 
the assumption of an open space. In an open space we are unable to distribute 
any finite amount of matter without setting up a center of the world. More- 
over, the stars would be doomed to gradually dissipate into infinity, with no 
chance of an equilibrium. In a finite universe these difficulties disappear. 
EINSTEIN thought that it should be possible to construct a finite and balanced 
universe in which the stars remain in the average in relative rest to each other. 
For this purpose he enlarged his original field equations by the so-called « cosmo- 
logical term » which for ordinary purposes is excessively small but which be- 
comes of decisive importance if it comes to spaces of cosmic dimensions. The 


connection between curvature tensor and matter tensor now became 


Sa + Gin VARE 


where 7 is the so-called « cosmological constant » [21]. EINSTEIN assumed a 


pre-established harmony between the average density and the cosmological 
constant, in order to make a static universe possible. It was shown, however, 
by G. LEMAITRE (!°) that the Finsteinian universe is not stable since it be- 


comes more and more time-dependent under the influence of the slightest 


disturbance. Moreover, the observations of E. P. HUBBLE of the distant ne- 
bulae showed conclusively that the universe is not static but in a state of 
constant expansion. This opened new avenues of cosmological investigations, 
with contributions from DE SITTER, FRIEDMAN, LEMAITRE, ROBERTSON, MILNE 
and others. EINSTEIN himself was not inclined to cosmological speculations 
of any specific kind. He considered our present knowledge of the role of matter 


(19) Science, 75, 538 (1932). 
(11) G. J. Wurtrow: The Observatory, 75, 166 (1955). 
(12) Ann. Soc. Sci. Bruxelles, 47, 49 (1927). 
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in general relativity so vague that any extrapolation into billions of years in 
the past or in the future seemed to him of not more than mythological value. 
He lost even confidence in the cosmological constant when Hubble’s obsery- 
ations disproved the hypothesis of a static universe [28]. But the conclusion 
that the physical space has an average curvature and is a closed and finite 
space, remains one of the most cherished cosmological consequences of Ein- 
stein’s theory. 


4. — The years between 1905 and 1925 were the heroic years in KINSTEIN’s 
scientific life. We could call them the King-Midas-years since everything he 
touched turned to gold in his hands. The achievements were on a magnificent 
scale and the results so overwhelmingly convincing, in spite of their revolu- 
tionary novelty, that they were immediately codified in the annals of science as 
treasures which will never be abandoned again. We now come to the second 
half of EINSTEIN’s scientific life. He still had thirty years in front of him. 
What will happen to a genius who has transcended so often the usual mea- 
sure of inventiveness granted to a single individual? 

The fires were still burning and the fervor and devotion with which he 
probed into the mysteries of the universe, remained unabated, until the last | 
day of his life. But the great disruption of European science on account of 
the Nazi hordes who swept over country after country, burning and destroying 
the intellectual values of Western civilization, had its effect on EINSTEIN. 
Generously the United States gave him sanctuary and he always gratefully 
acknowledged the splendid work conditions under which he could continue 
his scientific activities. But the loss of his friends and of the incomparable 
mental stimulus which the city of Berlin offered in the two decades between 
1910 and 1930, had something to do with his gradual dropping out of the 
limelights of contemporary physics. The friends and admirers of EINSTEIN 
often regretted the «splendid isolation » which became more and more his 
habit as the years went by. He said himself [43, p. 5]: «I live in that solitude 
which is painful in youth, but delicious in the years of maturity ». 

The most important springs of this isolation, however, lay in himself and 
in the inevitable destiny which drowe him forward. The ten years in which 
he worked on general relativity, had a profound influence on his philosophy 
and changed substantially his scientific outlook. EINSTEIN started his scientific 
career as a cool and sceptical positivist, strongly influenced by the teachings 
of the Viennese philosopher MAcH [46]. He put physical intuition highly 
above mathematical speculation and was inclined to see in mathematics 
nothing more than a purely formal tool of description which came about in 
MACH’s sense as an accident of historical evolution rather than as a necessary 
part of nature itself. But if one looked at the final draft of the new gravita- 
tional theory, one could not help seeing that it was the result of philosophical 
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and mathematical speculations of the highest order and not the outcome of 
a mere « description of observations » to which the positivistic philosophy rele- 
gates the role of theoretical physics. No amount of experimentation could 
ever suggest a set of equations such as the Einsteinian field equations of gra- 
vity. No amount of descriptive activity could ever bring a physicist to the 
strange wonders of Riemann’s curvature tensor. Under the impact of general 
relativity EINSTEIN abandoned the tenets of positivism and reverted to the 
old Greek natural philosophers who believed that nature is reasonable and 
explorable by mere philosophizing. Of course, the new mode of philosophizing 
appeared under much more exacting conditions. Geometry had no longer the 
simple pattern that it had in HucLIn’s time. The most advanced mathematical 
tools had to be employed in order to penetrate into the true core of the uni- 
verse. Thus EINSTEIN abandoned his earlier mistrust of mathematics and 
became a convert of abstract mathematical thinking—sometimes to a dan- 
gerous degree. But if we may regret that the ostracized himself to strange 
countries to which nobody was willing to follow him, we have to admire the 
intellectual honesty with which he pursued his aims. Hinstein-Faust gazed 
at the sign of the Macrocosmos for ten years, trying magic formula after magic 
formula, to conjure up the mighty spirit of creation. And then the magic 
formula was found and the mighty spirit did appear in all its splendor. Can 
we blame him if he could not settle down again to the gray daily chores of 
life? Can we expect that a man who has seen face to face the cosmic appa- 
rition, should help little people in building little homes? 

The challenge was too great to be satisfied with mere patchwork. Nature 
is certainly not composed of watertight compartments. If we were able to 
unriddle the mystery of gravity by inspired speculation, could not the same 
be done with the other forces of nature? Electricity is the most fundamental 
phenomenon of nature and EInsrEIN hoped that the structure of elementary 
particles could perhaps be explained as an equilibrium between the repulsive 
electric and the attractive gravitational forces. But what is electricity? 

The tying up of the curvature tensor with a matter tensor of external 
origin was not to ErnstEIN°s liking. All the great victories of his gravitational 
theory were made possible by the fact that matter is concentrated in very 
small portions of space. Hence it was essentially the equation 


which was needed in explaining the gravitational phenomena, without any 


specific «matter tensor». This matter tensor was not more than a feeble 
substitute for that great unknown that lurks in the atomistic structure of 
material particles and that escapes our knowledge completely. If we would 


understand the essence of electricity in the same way as we understand the 
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essence of gravity, all difficulties would disappear and the mystery of the 
atom would be explained. Gravity alone is certainly not enough for the cons- | 
truction of a reasonable universe because the solution of the equations of 
gravity leads to infinities which cannot be tolerated in a logical description 
of nature [35]. Something corresponding to electricity is demanded to make 
the construction of the elementary particle possible. But what is the geo- 
metrical meaning of electricity? 

The electromagnetic phenomena are described with the help of the Max- . 
wellian equations and these equations are based on an anti-symmetric tensor 
of second order F;, =—- F,;:, in contrast to the metrical tensor which is sym- 
metric: giz =9x:. Hence EINSTEIN started to search for an anti-symmetric 
tensor of second order in Riemann’s geometry which, however, he could not 
find. Well, then let us abandon Riemann’s geometry in favor of a still more 
general geometry. But here the situation was very different from his earlier 
journey when he abandoned Euclidean geometry in favor of Riemannian 
geometry. In the earlier journey he was guided by his unfailing physical 
intuition. Moreover, the path was laid out mathematically by the magni- 
ficent inner consistency and self-evidency of tensor calculus which was so 
eminently well adapted to the formulation of physical laws. And thus the 
earlier journey was a constant climbing to higher and higher vistas and the 
reaching of higher and higher plateaus. The foundation was so solid that the 
danger of crashes was eliminated, even if one deviated sometimes from the 
straight road. EINsTEIN’s later journeys are of a more jagged nature and - 
marred by many reverses. It was now a groping in the dark and his physical | 
intuition was of little help if the mathematical intricacies of the problem 
threatened to drown out the basic line of thought. The difficulty was that 
Riemannian geometry was logically the only possible heir to Euclidean geo- 
metry but if Riemannian geometry had to be abandoned, a number of throne- 
pretenders appeared on the scene. 

The first theory which attempted at a geometrical unification of electricity 
and gravity was developed by H. WeEYL (*). He based his deductions on 
the assumption that the real physical quantities are the ratios of the g;, and 
not the g,, themselves. This leaves a scalar free in the determination of the 
metric which can be represented with the help of a linear differential form 
g,dr,. The g; are then interpreted as components of the electromagnetic 
vector potential. Weyl’s geometry was only the first step toward a still wider 
group of geometries in which the line-element is altogether abandoned in favor 
of the /,-quantities which now become the fundamental elements of a non- 
metrical geometry in which the parallel displacement of vectors and tensors | 
is still possible. These so-called «affine geometries » were extensively inves- | 


(18) Ann. der. Phys., 59, 101 (1919). 
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tigated by EDDINGTON, EINSTEIN, and many others (!). In recent years 
Schrodinger made further advances and brought his researches in relation to 
the last unified field theory of EINSTEIN (15). EINSTEIN could never warm up 
to the idea of abandoning the line-element since he considered the very high 
persistency of spectral frequencies throughout the universe as a direct proof 
for the existence of an infinitesimal yardstick which can be transported without 
change from one point to the other. In the affine geometries there is no reason 
for the existence of such a rigid yardstick. 

EINSTEW’s own efforts to combine electricity and gravity into one unified 
geometriscal scheme can be catalogued under three major headings. 


Distant parallelism. Riemannian geometry does not allow the notion of 
parallelism between distant points which is independent of the path. By 
introducing n infinitesimal and mutually orthogonal unit vectors ha at every 
point of the manifold, we obtain 16 fundamental quantities from which the 
10 gx can be constructed with the usual Riemann parallelism. We have, 
however, absolute parallelism too and this gives rise to a new curvature 
tensor [26] 


Mida) = Az» 
which is anti-symmetric in i, k. In terms of these quantities a formal inter- 
pretation of the Maxwell equations becomes possible. EINSTEIN lost confidence 
in the theory when the field equations, derived from the simplest Hamil- 
tonian, allowed static solutions which corresponded to the fields of electrically 
charged particles, in contradiction to the Lorentz force which must bring 
such particles into motion [27]. Field equations without a Hamiltonian, on 
the other hand, remained too vague, on account of the many choices which 
are left open to them [29]. 


The five-dimensional world. Tu. KALUZA conceived the idea of generalizing 
Hinstein’s equations from four to five dimensions and considering the motion 
of a particle, in analogy to Hinstein’s postulate in ordinary gravity, as a 
shortest line (geodesic) of this five-dimensional manifold (1°). The fifth dimen- 
sion entered in the line-element but the fifth variable remained dormant since 
the gi, were supposed to be independent of the surplus variable. The motion 
of an electrically charged particle in an electromagnetic field (Lorentz-force) 
could thus be explained. But no clues could be drawn concerning the structure 
of elementary particles since the field equations did not allow solutions which 


(14) Cf. Sir E. WHITTAKER: History of the Theories of Aether and Electricity, 1900-1926 
(London, 1953), p. 190, for a condensed bibliography. 

(15) Proc. Roy. Irish Acad., 49, 43 (1943), and subsequent papers. 

(16) Preuss. Akad. Wiss. Berlin, Ber. 966 (1921). 
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avoided infinities [22]. VEBLEN and HorrMANN gave in 1930 a different inter- 
pretation of the fifth dimension, by changing from a metrical to a projective 
geometry (17). EInsrEIN arrived independently somewhat later to an equi- 
valent theory [30, 31], by introducing a flat five-dimensional space in addition 
to the curved four-dimensional space, and defining the relations which shall 
exist between the parallel displacements in these two types of spaces. He 
thus obtained a formal interpretation of the electromagnetic energy tensor. 
In further pursuance of these ideas he once more returned to Kaluza’s original 
theory [33] but replaced the unnatural assumption that the fifth variable 
remains completely inactive in the structure of the world, by a more natural 
hypothesis. 


The non-symmetrical line-element. In the last years of his life EINSTEIN 
returned to the four-dimensional structure of the world but thought to enlarge 
the ordinary point-transformations of general relativity by transformations 
involving complex coefficients. If we do so, the g;, cease to be real functions 
of the variables but become complex numbers, yet in such a way that the 
entire line element remains real; (Hermitian forms). The symmetry condition 
Gia = Gr: 18 now replaced by the Hermitian condition [36] 


Of Iii . 


The real part of the g;, give a symmetric, the imaginary part an antisymmetric 
tensor. They can be correlated to gravitation and electromagnetism respectively. 
Later EINSTEIN abandoned the complex nature of the line-element and as- 
sumed a real but non-symmetric g;,: If the g;, became symmetric, one returned 
to the original gravitational equations. The anti-symmetric part of the g;,, 
however, could be made responsible for the appearance of electricity in nature. | 
This theory went through several modifications but the basic ideas remained 
unchanged [37, 40]. They represent the last efforts of EINSTEIN to penetrate 
to the ultimate secrets of the universe. His last paper on the unified field 
theory came out shortly after his death [41]. 

For the final evaluation of EINSTEIN the scientist it makes not the slightest 
difference whether he did or did not find the ultimate equations of the uni- 
verse. He discovered such a galaxy of dazzling ideas and gave mankind a 
program of such nobility that his unique position in the history of human 
thought is ensured for all ages. 


o. — We will conclude this sketch of the scientific evolution of ALBERT 
EINSTEIN with a brief summary of his life-data [46, 47]. He was born in Ulm 
(Germany), on March 14, 1879, but his parents soon moved to Munich where 


(1°) Phys. Rev:,' 36, 810 (1930) 
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he received his primary and secondary schooling. He did not like school, nor 
did school like him; (his Latin professor told him once: « Einstein, you will 
never amount to anything»). His great liking for the natural sciences 
and for geometry was clearly marked from his earliest youth. When his father 
failed in his engineering business and emigrated to Italy, EINSTEIN went to 
Switzerland in order to finish his secondary education. The Swiss form of 
democracy and its freedom from any military pressure was much more to his 
liking than the nationalistic and militaristic tradition of Germany. In 1901 
he acquired Swiss citizenship. His college years were spent at the Polytecnicum 
of Zurich where he graduated in 1901. In 1902 he became a Swiss civil servant 
by accepting a minor position in the Patent Office of Berne. Here he remained 
up to 1909. His duties were in no way inspiring but left him much time for 
day-dreaming in physics. He got married in 1902 and two sons were born 
to him. However, the marriage was not a success and was dissolved in 1919. 
In 1909 he was already an internationally known physicist of the first rank. 
He received a call to the University of Zurich, where he remained up to 1914, 
with an interlude of one year in 1911-12, spent as full-professor of physics 
at the University of Prag (Austria). In 1914, at the instigation of Max PLANCK 
and supported by NERNST, WARBURG and RUBENS, EINSTEIN was elected a 
member of the Berlin Academy of Sciences, under extraordinarily favorable 
conditions. He was made Director of the Research Institute for Physics of 
the Kaiser-Wilhelm-Institute in Berlin. Free of any official obligations, free 
even of any lecturing duties, he could devote himself fully to the pursuance 


‘of his speculations. His years in Berlin (from 1914 to 1933), were extraordi- 


narily happy. He attended regularly the physics seminars of the university. 
These seminars saw a galaxy of stars of first magnitude which filled the audience 
with awe: EINSTEIN, PLANCK, LAUE, NERNST, SCHRODINGER, RUBENS were 
almost always present. The lively discussions which developed after every 
presentation—reviews of the latest stand of experimental and theoretical 
research in all branches of physics—remain in the indelible memory of everyone 
who attended this incomparable gathering. In 1919 EINSTEIN married his 
cousin Elsa with whom he lived happily until her death in 1936. When the 
great Dutch physicist H. A. LorENTZ retired in 1923 from his chair of theoretical 
physics at the University of Leyden, EINSTEIN was appointed as his successor. 
Since, however, he did not want to lose his Berlin associations, he consented 
to go to Leyden every year for a temporary visit, staying with his intimate 
friend EHRENFEST and giving lectures at the university as a visiting professor. 
EINSTEIN was fortunately abroad when in 1933 the Nazis came to power. | 
Recognizing the barbarous character of the new regime, he immediately re- 
signed from his Academy post and renounced his German citizenship. The 
State confiscated his possessions, Already in 1932 the newly founded Prin- 
ceton Institute for Advanced Study elected him to his faculty. This position 
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became now permanent. EINSTEIN emigrated to the United States and be- 


came U.S. citizen in 1940. In 1945 he reached retiring age but was requested 
to stay. He, however, did not want that an exception should be made in 
his favor and went into retirement, although continuing with his daily studies. 
His Princeton years were principally devoted to the motion problem of general 
relativity [32, 34] and to an extension of his gravitational theory to the inclu- 
sion of electricity. His efforts culminated in the « unified field theory » which 
went through several formulations. The last formulation which he considered 


the final one, came out after his death [41]. His health deteriorated since the | 


end of 1954, but he continued his regular routine, although now working 
mostly at home. 

The shadows were lengthening as the years passed by. The raging intel- 
lectual fire burned out his organism prematurely. EINSTEIN looked always 
about ten years older than he actually was. More than once in his life he was 
dangerously ill but always with a good chance of recovery. During the last 
year, however, the rapid decline of his physical forces became alarmingly 
manifest. When in April of this year he was transferred to the hospital of 
Princeton, he knew that no hope was left. Calmly and unperturbed, with no 
pathos, no sentimentality, no regret, he waited for the approaching death. 
He died with the same simplicity and humility with which he lived. « Homo 
liber de nulla re minus quam de morte cogitat », said the great Dutch philo- 
Sopher SPINOZA whom EINSTEIN held in particularly high esteem. «The free 
man thinks of nothing less than of death ». ALBERT EINSTEIN was a free man! 
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IL NUOVO CIMENTO 


FONDAZIONE FRANCESCO SOMAINI 
PRESSO IL TEMPIO VOLTIANO, A COMO 


enya 


ee 


BANDI DI CONCORSI 
AL PREMIO E ALLA BORSA PER IL 1958 


D 


Con lo scopo di premiare e incoraggiare nel nome di ALESSANDRO VOLTA gli studi 
di Fisica in Italia, la « Fondazione Francesco Somaini », presso il Tempio Voltiano a 
Como, indice i seguenti concorsi. i 


A) Concorso al “ Premio Triennale per la Fisica Francesco Somaini”’ per il 1958. 
di L. 1500000 (un milione e cinquecentomila) nette, da assegnarsi al concorrente che, 
fra quelli che la Commissione Giudicatrice giudicherà in senso assoluto meritevoli del 
premio per i risultati conseguiti nello studio della Fisica durante.il Triennio 1° Luglio 1955. 
30 Giugno 1958, sia, a parere della Commissione stessa, il più meritevole. 


B) Concorso alla “ Borsa Francesco Somaini per lo studio della Fisica ’’ per il 1958. 
di L. 750000 (settecentocinquantamila) nette, da assegnarsi al concorrente che. tra 
quelli che la Commissione Giudicatrice giudicherà in senso assoluto meritevoli della 

_ Borsa, verrà dalla Commissione stessa giudicato il più meritevole, sia per titoli, pre- 
parazione scientifica, lavori già svolti e risultati già conseguiti nella Fisica, sia anche 
per il vantaggio che. gli studi, per i quali è richiesta la Borsa, possono portare allo svi- 
luppo della Fisica, in Italia. 


ESTE 


Da, 
Pi 


Dis 0 Ey 


“ 


PRS n 


1. — Adentrambii Concorsi possono prendere parte singolarmente i cittadini, d’ambo 
i sessi, italiani e svizzeri del Canton Ticino purchè di stirpe italiana. Sono esclusi dal . 
Concorso i membri della Commissione Amministratrice e della Commissione. Scien- 
tifica della « Fondazione Francesco Somaini ».. 


i È ‘ i ah 
2. — Le norme particolareggiate dei singoli Concorsi verranno pubblicate in appo- |, 
sito volantino che potrà essere richiesto dagli interessati alla Segreteria della Fonda-. 
zione presso il Tempio Voltiano a Como. 


8. — La domanda, i documenti, i lavori, ecc., presentati dai singoli concorrenti 
dovranno pervenire, tra il 1° Gennaio e le ore 12 del 1° Luglio 1958, alla Commissione Am- 
ministratrice della « Fondazione Francesco Somaini » a Como presso il Tempio Voltiano. 


4. — Il Premio Triennale per la Fisica potrà essere anche conferito a uno studioso 
che non abbia preso parte al Concorso, ma sia stato segnalato da un Membro della. 
Commissione Giudicatrice, con proposta motivata, come meritevole di particolare con- 
siderazione oppure ritenuto degno di premio dalla Commissione Giudicatrice, indipen- 


dentemente» da ogni segnalazione. 
x OK OF 


4 La procedura dei suddetti Concorsi è regolata secondo lo Statuto della Fondazione, 
il quale è ostensibile a Como presso il Tempio Voltiano ed è depositato negli atti del 
Notaio Dr. Raoul Luzzani di Como. 


Como, dal Tempio Voltiano, il giorno 6 Agosto 1955. 
Il Segretario Conservatore del Tempio Il Presidente Sindaco di Como 
) > NT 
Cesare MORLACCHI: PAOLO PIADENI 


LL tees in esteso di detti Concorsi è stato pubblicato nel fascicolo di Gennaio dél Nuovo Cimento 1956). 
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SOCIETÀ ITALIANA DI FISICA 


CONCORSO SPECIALE A UN PREMIO 
PER STUDI SUI MATERIALI SEMICONDUTTORI 


Prendendo occasione dalla prossima inaugurazione a Forlì del Centro di Ospitalità 
per scrittori, artisti e scienziati, fondato, in memoria di Livio e MARIA GARZANTI, 
dal figlio, l’Editore ALDO GARZANTI, questi ha istituito un premio scientifico, del quale 
ha fissato la natura e lo scopo con la sequente dichiarazione: ; : 

«Considerando la grande importanza che in questi ultimi anni hanno acquistato, 
sia nel campo strettamente scientifico sia nelle applicazioni, i materiali semiconduttori, 
viene istituito un premio di un milione di lire a favore di uno studioso italiano che, entro 
il 31 Maggio 1956, abbia presentato una memoria, nella quale, premesso un breve 
riassunto sulla Fisica di questi materiali, siano indicate, nei termini più concreti, le 
possibilità di fabbricazione e di impiego di essi in Italia, con-riferimento alle materie 
prime qui esistenti, ai costi e alle condizioni della nostra industria. Sarà tenuto conto 
dei contributi scientifici e tecnici personali che il candidato abbia eventualmente portato 
alla conoscenza delle proprietà e delle applicazioni di questi materiali. Il premio sarà 
assegnato alla migliore tra le memorie giudicate degne dalla apposita Commissione. 
La proprietà letteraria e il diritto di eventuale pubblicazione della memoria vincitrice 
‘spettano all’Editore Garzanti ». 

La Società Italiana di Fisica, richiesta dall’Editore Garzanti, ha ben volentieri 
acconsentito, applaudendo alla iniziativa di lui, a patrocinare il Concorso e curare tutte 
le operazioni che esso comporta: e quindi d’intesa con l’Editore stesso ha formulato 
il seguente Bando. 


1. — È aperto un concorso a un premio di L. 1000000 (un milione) intitolato 
«Premio Livio e Maria Garzanti ), per studi sui materiali semiconduttori. 


2. — Possono partecipare al Concorso le persone di nazionalità italiana che non 
facciano parte della Commissione esaminatrice. 


3. — Le domande di ammissione al Concorso, redatte su carta libera e recanti le 
generalità e il recapito del concorrente, dovranno pervenire alla Presidenza della Società 
Italiana di Fisica (Milano, Via Saldini, 50) entro il 31 Maggio 1956. 

Alla domanda il concorrente dovrà unire: a) la dichiarazione firmata che egli è 
di nazionalità italiana; b) cinque copie (dattilografate) di una memoria originale, da 
lui compilata, nella quale, premesso un breve riassunto sulla Fisica dei materiali semi- 

' conduttori, siano indicate, nei termini più concreti, le possibilità di fabbricazione e d’im- 
piego di essi in Italia, con riferimento alle materie prime qui esistenti, ai costi e alle 
condizioni della nostra industria; c) quei titoli, documenti e pubblicazioni (queste in 
cinque copie) che possano attestare il contributo portato dal concorrente alla cono- 
scenza delle proprietà fisiche e alle applicazioni tecniche e pratiche dei materiali semicon- 
duttori; d) sei copie (dattiloscritte) dell’elenco completo di tutte le carte presentate. 


_ 4. — La Commissione giudicatrice del concorso è costituita da cinque membri, di 
cui quattro nominati dal Consiglio di Presidenza della Società Italiana di Fisica e uno 
dall’Editore Aldo Garzanti in sua rappresentanza. 


5. — Il Premio sarà assegnato alla migliore tra le memorie giudicate degne dalla 
Commissione. La proprietà letteraria e il diritto dell'eventuale pubblicazione della 
memoria vincitrice spettano all’Editore Aldo Garzanti. 


x 


6. — Il giudizio della Commissione è inappellabile. 


7. — La proclamazione del vincitore avra luogo, per quanto in tempo, alla inaugu- 
razione del Centro di Ospitalita di Forli, altrimenti, insieme con il conferimento del 
Premio stesso al Congresso Internazionale di Fisica che si terrà nel prossimo Set- 


tembre a Torino per celebrare il grande fisico e chimico italiano AMEDEO AVOGADRO 
nel centenario della sua morte. 


Il Segretario: G. C. DALLA NocE Il Presidente: G. POLVANI 


SUPPLEMENTO AL VOLUME II, SERIE X N. 9, 1905 
DEL NUOVO CIMENTO 2° Semestre 


Carl Friedrich GauB (*). 


M. KRAFFT 


Philosophische Facultit der Philipps-Universitdt, Marburg an der Lahn - Deutschland 


Jeder, der es unternimmt, tiber die wissenschaftlichen Leistungen von CARL 
FRIEDRICH GAUSS su schreiben, sieht sich in einer sehr unbehaglichen Lage 
und bereut sein Unterfangen je langer, je mehr. Dabei sind die Unterlagen 
fiir eine solche Arbeit vollstandiger und umfangreicher als sonst in entsprechen- 
den Fallen. Es steht nicht nur die groBe, auch den NachlaB verarbeitende 
zwolfbandige Ausgabe der Werke [1] zur Verftigung, deren Vollendung und 
hohen Wert man wesentlich der Tatkraft von FELIX KLEIN verdankt, sondern 
die Bande X, und XI, der Werke bestehen ganz aus eingehenden Wirdigungen 
der Leistungen von GAUSS auf seinen verschiedenen Arbeitsgebieten durch 
hervorragende Fachgelehrte [2]. Dazu kommen die umfangreichen Veròffent- 
lichungen des Briefwechsels von GAuss [3]. Eine sehr eindrucksvolle kurze 
zusammenfassende Wiirdigung der Leistungen von GAUSS verdankt man F. 
KLEIN [4]. Die erste groBe Schwierigkeit ist nun, da8 sich die Arbeit von 
GAuss auf so viele verschiedene Gebiete erstreckt, die heute wohl niemand 
mehr gleichmaBig beherrscht. Die zweite ist die Fulle des Stoffes, die jeder 
Darstellung durch die nòtige Auswahl eine stark subjektive Farbung gibt. 
Erschwert wird die Auswahl noch durch die iberraschenden Verbindungen 
scheinbar heterogener Fragen. Am unangenehmsten ist aber die dritte Schwierig- 
keit: Gauss hatte leider die Gewohnheit, zahlreiche seiner bedeutendsten 
Untersuchungen nicht zu veròffentlichen. Er hat dafùr manchen berechtigten 
Tadel von seinen Freunden hinnehmen miissen. So schrieb ihm 1833 der Astro- 
nom SCHUMACHER, auf die Devise des GauBschen Siegels « Pauca sed matura » 
anspielend: «Ich komme wieder auf meinen alten Wunsch zuriek, daB es 
Ihnen gefallen mége, linger und schneller zu schreiben. Ich wirde Ihnen 
dann sogleich ein neues Petschaft zu tiberreichen wagen mit “multa nec im- 
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matura,,». Alle diese zuriickgehaltenen Ergebnisse wurden z.T. noch bei 
Lebzeiten von GAuss, z.T. in den nàchsten Jahrzehnten nach seinem Tode 
yon anderen Forschern wieder gefunden. Dabei haben in einzelnen Fallen 
versteckte Hinweise von GAUSS eine wichtige Rolle gespielt. Die lebendige 
Entwicklung kniipft iiberall an diese Neuentdecker an. Fur die Kenntnis der 
Persénlichkeit von GAUSS ist es unerlaBlich, auch auf diese von ihm nicht 
veroffentlichten Arbeiten einzugehen, man mu aber vermeiden, den spateren 
Forschern ihr Verdienst zu schmàlern. 


* OK ck 


CARL FRIEDRICH GAUSS wurde am 30. April 1777 in Braunschweig als 
Sohn armer Eltern geboren. Schon in der Grundschule fiel er seinen Lehrern 
durch seine Begabung auf. Sie rangen daher dem widerstrebenden Vater die 
Zustimmung zu dem durch Stipendien erméglichten Ubergang seines Sohnes 
auf das Gymnasium ab. Auch die Lehrer des Gymnasiums und des weiter- 
fiihrenden Carolinums (der Keimzelle der heutigen Technischen Hochschule) 
nahmen sich des jungen GAuss vorbildlich an. Vor allem empfahlen sie ihn 
dem regierenden Herzog KARL WILHELM FERDINAND. Dieser gewahrte GAUSS 
eine fiir das arme Land sehr groBziigige finanzielle Unterstiitzung, die im Laufe 
der Zeit auf die ftir die damaligen Verhaltnisse erhebliche Summe von 600 Talern 
jahrlich anstieg und an keine Verpflichtung irgend welcher Art gebunden war. 
Erst mit dem Zusammenbruch der alten Staatsordnung durch den Sieg NAPO- 
LEONS tiber PreuBen bei Jena und Auerstedt 1806 und dem Tode des Herzogs 
an der in der Schlacht erlittenen Verwundung nahm diese landesvaterliche 
Firsorge ein Ende. 1795-1798 besuchte GAuss die Universitat Géttingen, 
durch deren Bibliothek er EULER, LAGRANGE und die anderen groBen Forscher 
kennenlernte, kehrte dann nach Braunschweig zurtick und promovierte 1799 
bei J. F. PFAFF an der braunschweigischen Landesuniversitàt Helmstedt. 
1807 wurde er noch vor Grindung des kurzlebigen napoleonischen K6nigreichs 
Westfalen als Professor der Astronomie und Direktor der Sternwarte nach 
Gottingen berufen. In dieser Stellung blieb er bis zu seinem Tod am 23. Fe- 
bruar 1855. Die ersten Jahre in Géttingen waren durch Not infolge von Kriegs- 
kontributionen und durch das Ausbleiben oder die verspàtete Zahlung des 
Gehaltes, verursacht durch die Verschwendung des Hofes und die ununter- 
brochenen Kriege, getribt. Dazu kam viel familiares Ungliick, vor allem 1809 
der Tod seiner heiB geliebten ersten Frau und 1831 der seiner zweiten Gattin. 


GAUSS ist in der Mathematik und Astronomie und den andern von ihm 
gepflegten Wissenschaften wesentlich Autodidakt, genau wie die ganze iltere 
Generation seiner wissenschaftlichen Freunde und Korrespondenten, die alle 
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aus anderen Berufen herkamen. Es geniigt, an die Astronomen OLBERS, 
V. ZACH, LINDENAU, SCHUMACHER, BESSEL und den Mathematiker PFAFF zu 
erinnern. Hieraus und aus der nur dem Elementarunterricht dienenden Ziel- 
setzung der Philosophischen Fakultàten im 18. Jahrhundert erklirt sich die 
groBe Abneigung von Gauss, fiir die wenig interessierten und ungentigend 
vorgebildeten Studenten Vorlesungen zu halten. Wirklich Begabten hat er in 
individueller Forderung stets geholfen. Vielleicht erklairt dies, daB seine per- 
sdnlichen Schuler, von denen als Beispiele nur ENCKE, MOBIUS, v. STAUDT, 
DEDEKIND, LIsTING und MINDING genannt seien und denen man auf Grund 
seines Habilitationsvortrages wohl auch den von DIrIicHLET herkommenden 
RIEMANN zurechnen darf, zu einem nicht unerheblichen Teil nicht Arbeiten 
ihres Meisters fortsetzten, sondern eigene Wege gingen. 


Kennzeichnend fiir die wissenschaftliche Arbeit von Gauss ist das Streben 
nach Ausschaltung jeder vermeidbaren Unsicherheit in der Wissenschaft und 
jeder wenn auch unbewuBten subjektiven Verfalschung der Ergebnisse. Dieses 
Ziel eines Héchstmakes von Objektivitàt erforderte die Abwendung von den 
laxen Beweismethoden des 18. Jahrunderts in der Mathematik und die Riick- 
kehr zum «rigor antiquus », zu den strengen Beweisverfahren der antiken 
Mathematiker. In der Physik und den anderen messenden Wissenschaften 
fihrte es zwangslaufig zur Ablehnung willkurlicher Auswahlen von Messungen 
in einer Messreihe und zum Grundsatz der Gleichberechtigung aller ihr an- 
gehérenden Messungen. Dazu kam als weitere Forderung, daB die Grund- 
sitze zur Bearbeitung einer MeBreihe schon vor der Reduktion feststehen 
sollten. Ebenso gehòrt hierher die sorgfaltige Untersuchung der Instrumental- 
fehler und die Entwicklung von MeBverfahren, bei denen sie méglichst un- 
schidlich gemacht werden. Die leidenschaftliche Wahrheitsliebe von GAUSS 
hat dieses Programm nicht nur aufgestellt, sondern wesentlich zu seiner Ver- 
wirklichung beigetragen. An seine Arbeiten stellte GAuss nicht nur inhalt- 
lich, sondern auch in der Form héchste Anspruche. Das fùhrte leider dazu, 
daB in der veréffentlichten Gestalt meist nicht mehr zu erkennen ist, wie die 
Ergebnisse gefunden wurden. Daraus ergeben sich oft Schwierigkeiten fiir ein 
wirkliches Verstàndnis. 


Zu den ersten, schon 1792 beginnenden mathematischen Betàtigungen von 
Gauss gehért seine Beschiftigung mit dem Parallelenaxiom. Wenn er schon 
als fiinfzehnjahriger an eine Geometrie denkt, « die stattfinden miBte, wenn 
die Euklidische Geometrie nicht die wahre ist », so hat dieser Gedanke offen- 
bar zu keinen unmittelbaren Konsequenzen gefiihrt. Fur die weitere Ent- 
wicklung ist die Uberlieferung sehr liickenhaft, nur die Briefe an seinen Studien- 
freund WoLFGANG BoLyAI, der sich ebenfalls mit dem Parallelenaxiom be- 
schiftigte, lassen einige Schliisse zu. GAUSS scheint danach wie verschiedene 
Vorganger die Geometrie, die sich bei Verwerfung des Parallelenaxioms ergibt, 
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entwickelt zu haben, in der Hoffnung, auf einen Widerspruch zu stoBen. 
Dabei setzte sich bei ihm im Laufe einer von Riickschlagen nicht freien Ent- 
wicklung und offenbar gegen starkes inneres Widerstreben die Uberzeugung 
von der Unméglichkeit eines Beweises durch. Wie schwierig dieser Bruch mit 
der Uberlieferung war, erhellt vielleicht am besten daraus, daB ein so hervor- 
ragender Mathematiker wie LEGENDRE nie dahin gelangte. Etwa 1816 war 
ia vorlàufiger Abschlu8 erreicht, die « nichteuklidische Geometrie » war ent- 
standen. DaB von den beiden denkbaren nichteuklidischen Geometrien, die 
wir heute als elliptische und hyperbolische Geometrie bezeichnen, damals und 
ebenso bei den Wiederentdeckern zunàchst nur die hyperbolische Geometrie 
entstand, hingt damit zusammen, da8f die elliptische Geometrie durch die 
Beibehaltung des eudoxisch-archimedischen Axioms (« eine GroBe kann so oft 
vervielfaltigt werden, daB sie jede andere gleichartige tibertrifft ») ausgeschlossen 
wurde. GAuss hielt mit seinen Erkenntnissen gerade hier Angstlich zurick 
und legte seinen Korrespondenten Stillschweigen auf, weil Andeutungen in 
einer Rezension ihm ihn verstimmende Angriffe eingetragen hatten. Es waren 
allerdings Teile der neuen Geometrie so denkungewohnt, daff dies nicht ver- 
wundern darf. Das Nichtvorhandensein &àhnlicher Figuren und die Existenz 
einer ausgezeichneten Lange gehòren dahin. In der Folgezeit hat sich GAUSS 
wahrend seiner Beschaftigung mit der Flachentheorie wieder mit der nicht- 
euklidischen Geometrie befaBt, wie der Briefwechsel zeigt. Er brach seine 
Untersuchungen aber fast véllig ab, als die nichteuklidische Geometrie un- 
abhangig zweimal neu entdeckt wurde. Beginnend 1823 hatte JOAHNN BOLYAI, 
der Sohn von GAuss Jugendfreund, damals Pionierleutnant in Temesvar, die 
nichteuklidische Geometrie neu entwickelt und sie 1831 veròffentlicht. Gauss 
berichtet darùber seinem Schiller GERLING und schreibt: «Ich halte diesen 
jungen Geometer v. Bolyai fiir ein Genie erster GròBe. » Ebenfalls 1823 be- 
gannen die Untersuchungen des russischen Mathematikers NIKOLAI IWANO- 
WITSCH LOBATSCHEFSKIJ, Professor in Kasan. Seine ersten russischen Publi- 
kationen blieben in Westeuropa, wenn sie tiberhaupt dahin gelangten, vdllig 
unbekannt. Erst die zweite deutsch geschriebene Arbeit von 1840 wurde in- 
folge einer abfalligen Besprechung von Gauss beachtet. Gauss, der noch 
uber sechzigjahrig die russische Sprache erlernt hatte, verschaffte sich pun 
auch die russischen Abhandlungen LOBATSCHEFSKIJS. Die Ernennung des rus- 
sischen Gelehrten zum Korrespondierenden Mitglied der Gesellschaft der Wis- 
senschaften in Géttingen ist ein Zeichen der Wertschàtzung, die Gauss ihm 
entgegenbrachte. 

Allgemeiner bekannt wurden die Leistungen von BoLvAr und LoBA- 
TSCHEFSKIS erst durch die unermédliche Arbeit und die Ubersetzungen von 
HovEL, der fiir beide noch die gewaltige Autoritàt von Gauss durch Uher- 
setzung von Teilen an SCHUMACHER gerichteter Briefe in die Wagschale warf. 
Es fehlt hier der Raum, die Bedeutung der Entdeckung der nichteuklidischen 
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Geometrie fiir die geometrische Forschung auch nur anzudeuten. Daher be- 
schranken wir uns auf einen Hinweis: Die Geometrie EukLIDs war der Idee 
nach die Lehre von der Ordnung unseres Erfahrungsraumes, also im Grunde 
ein Stuck Physik. Nun stellte sich heraus, daB die neue Geometrie bei ge- 
eigneter Verfùgung iber die in ihr auftretende absolute Konstante dasselbe 
leistete, da die Abweichungen unter die Schranke der MeBgenauigkeit fielen. 
Der philosophische Anspruch auf den a priori-Charakter der euklidischen Geo- 
metrie wurde damit unhaltbar. Die Weiterentwicklung fithrte schlieBlich dazu, 
da die Lehre vom Erfahrungsraum, soweit sie nicht zur Domine der Psycho- 
logie gehòrt, unzweifelhaft ein Teil der Physik ist und bleiben wird. 


Ebenfalls seit 1792 beschaftigte sich GAUSS mit drei Themen, die, scheinbar 
wesensfremd, im Laufe der Zeit zu einer groBartigen Einheit zusammenwuchsen. 
Das erste ist das sog. « arithmetisch-geometrische Mittel »: gegeben seien zwei 
beliebige positive Zahlen a und b. Man bildet die beiden Zahlenfolgen 


a ua b Ay + bi , Un 4 b, 
h = — = 506 pi = 550 
BE 2 5) 2 9 o) b) NTI DI ? 3) 
b, = Vab, Desi eORO: cake oe Bait = Vanba , 


Es ist leicht zu zeigen, daB diese beiden Folgen ein-und demselben Limes 
Wa, b), dem sog. arithmetisch-geometrischen Mittel zustreben. Nach dieser 
Definition ist tibrigens 97(a, b) = 2(a,, b))= .... Das zweite Thema sind 
Potenzreihen der Gestalt ow und verwandte Reihen. Zu diesen gehòren 
die Reihen, die man heute Thetanullwerte nennt. Das dritte Thema ist die 


Untersuchung des Integrals ftir die Bogenlange der Lemniskate | dx/ val GI). 


0 
Gauss faBt dieses Integral als Verallgemeinerung des Integrals fiir den Arcus- 


sinus auf. Von dieser Analogie geleitet, betrachtet er die obere Grenze des 
Integrals als Funktion des Integralwertes. Diese von ihm Sinus lemniscaticus 
genannte Funktion ist die erste elliptische Funktion nach heutiger Bezeichnung. 
Fiir die Weiterentwicklung entscheidend war, dafii Gauss schon damals von 
den komplexen Zahlen konsequent Gebrauch machte. Die bekannte geome- 
trische Veranschaulichung dieser Zahlen fand er erst 1799 und veròffentlichte 


sie erst 1831. 


Seine Beschiftigung mit den sog. Kreisteilungsgleichungen fuùhrte GAUSS 
dazu, auch die Gleichung fiir die Teilung des Lemniskatenumfangs in n gleiche 
Teile aufzustellen. Zu seiner Uberraschung war diese Gleichung nicht, wie 
erwartet, vom Grad n, sondern vom Grad n°. Schon 1797 findet er, « warum » 
dies so ist. Diese lakonische Feststellung kann nichts anderes bedeuten, als 
daB er damals die zweite, imaginire, Periode des Sinus lemniscaticus entdeckt 
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haben mu. Sein kurzer Hinweis in Artikel 335 seiner « Disquisitiones arithme- 
ticae », daB sich seine Untersuchungen zur Kreisteilung u.a. ausdehnen lassen 


von den Kreisfunktionen auf die von dem Integral | dx/\/ 1— xi abhan- 
0 


genden Funktionen, war iibrigens das einzige, was GAUSS iiber diese Unter- 
suchungen berichtete. Dieser Hinweis hat aber Geschichte gemacht, fuhrte 
er doch ABEL und JAcoBI zur Neuentdeckung der Theorie der elliptischen 
Funktionen. 


Inzwischen war auch die Theorie des arithmetisch-geometrischen Mittels 
weiter gefordert. GAuss hatte die Potenzreihenentwicklung von 97(1-+2, 1—x)= 
cuni use a?) versucht, war aber dann zur Entwicklung der Reziproken 
iibergegangen, bei der sich ein iiberraschend einfaches Koeffizientengesetz 
ergab. 1799 bemerkte er zufillig, daB die Dezimalbriiche fiir 1/97@(1, v2) und fiir 


als 
ale 
= 
e 
| || R 
S 
~ 


(dieses © ist die reelle Halbperiode des Sinus lemniscaticus) auf 11 Stellen 
ilbereinstimmten. Das fiihrte ihn zur Entdeckung, daB 


L 


LE al GS da 
an(1, Vi—v) Li V/(1 231 2x?) 


sein muBte. Offenbar durch Unklarheiten iber die Integration mehrdeutiger 
Funktionen im Komplexen veranlaBt, wendet sich GAuss nicht der Umkehrung 
des allgemeinen elliptischen Integrals erster Gattung zu, sondern verallgemeinert 
formal altere Ergebnisse tiber den sinlemng. Diesen hatte er in eine nach 
den Sinus der ungeraden Vielfachen von g fortschreitende Reihe entwickelt 
und ihn auBerdem als Quotienten zweier Thetareihen dargestellt. Dazu kamen 
fir V@/x zwei Darstellungen durch Thetanullwerte. Gauss gibt nun ein yu 
vor und setzt 
IT IT 


(0 = = =’. 


MI, V1 + 1?) uan, VI +1jud) 


Der «Sinus lemniscaticus universalissime acceptus » Sl@ ist dann der Quotient 
zweier mit den Argumenten (7/6) und «= exp[—2(6'/@)| (dieses x ist 
nichts anderes als das Jacobische q) gebildeter Thetareihen. Es ist unmittelbar 
ersichtlich, daB dieses Sl@ die beiden Perioden 26 und 26'i hat. Setzt man 


sinu =Slq, so folgt du/dp = VI +? sin? w und damit, daB Sl@ die Umkehr- 
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funktion von 


S 


ds 


JVA— S*)(1 + p28?) 
ist. Dieses Ergebnis ist nur scheinbar befriedigend. Denn geht man von 
der Funktion Sly aus, so kann man © und @' oder auch « beliebig vorgeben 
mit der einen, fiir die Reihenkonvergenz nétigen Finschrànkung, daB der Real- 
teil, von 6'/@ positiv, d.h. |e|< 1 ist.  erhalt man aus einer Darstellung von 
V6/x als Thetanullwert, und auch wu 148t sich durch Thetanullwerte berechnen. 
Vom Integral kann man nicht gut ausgehen, da mw irgend eine komplexe Zahl 
sein kann, die arithmetisch-geometrischen Mittel, durch welche © und @' 
erklart sind, sich also nicht bilden lassen, weil nicht feststeht, wie die hierzu 
nòtigen Wurzelziehungen gemeint sind. Aber selbst wenn das geklart ist, 
bleibt fraglich, ob der Realteil von '/6, wie nétig, positiv ausfallt. 

Gauss làBt zunachst bei der Berechnung der 977(a, 6) definierenden Folgen 
a;, b; bei jedem Schritt die Wahl zwischen den beiden méglichen Quadrat- 
wurzeln frei. Dann erhalt man zu den Ausgangsdaten a, b unendlich viele 
verschiedene 297(a, b) = a970(1, b/a), die nicht Null sind (die verschwindenden 
sind fiir uns ohne Interesse). Damit wird 977(1, X') eine unendlich vieldeutige 
Funktion von k'. Die Reihen fiir die Thetanullwerte erlauben nun, 977(1, k') 
zu uniformisieren, d.h. sowohl k’ als auch 97¢(1, k'), als eindeutige Funktionen 
eines Parameters # anzugeben. Liefert ein bestimmtes ¢ einen Wert von k’, 
so hat man zun&chst sAmtliche andere ¢, welche denselben Wert von k’ liefern, 
zu ermitteln. Die Beantwortung dieser Frage gelingt GAUSS, da ihm gewisse 
Transformationseigenschaften der Thetanullwerte bekannt sind. Diese neuen ¢ 
gehen aus dem urspriinglichen ¢ durch gewisse gebrochene lineare Substitu- 
tionen hervor, die zusammen eine Gruppe bilden. Diese ist im Grunde nichts 
anderes als eine sog. Hauptkongruenzuntergruppe der bekannten Modulgruppe. 
k'= k'(t) ist damit als Modulfunktion erkannt, 

GAUSS hat ibrigens, wie aus Notizen hervorgeht, auch die heute « absolute 
Invariante » genannte, zur vollen Modulgruppe gehòrige Modulfunktion in 
Betracht gezogen. Mehr ist dariber nicht bekannt, doch haben gerade diese 
Notizen 1877 DEDEKIND zur Hinfiihrung und Untersuchung dieser, von ihm 
« Valenz » genannten Funktion veranlaBt. Die Uniformisierung des arithme- 
tisch-geometrischen Mittels mittels des Parameters t wird nun dazu benutzt, 
um solche © und @’ als « zusammengehòrig » zu erkléren, welche zum gleichen # 
sehoren. Es folgt dann, daB t = 0/0 ist, allerdings unter der Voraussetzung, 
dali es ein ¢ gibt, welches das gegebene = liefert. Gauss hat 1805 die 
Modulsubstitutionen geometrisch gedeutet und den Diskontinuitàtsbereich der 
Gruppe in der t-Ebene wirklich aufgezeichnet. In diesem liegen nur t mit 
positivem reellen Teil. Man kann durch Abbildung des Randes des Diskon- 
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tinuititsbereiches dessen Abbildung in die %'-Ebene erschlieBen und daraus 
entnehmen, daB k’? im Diskontinuitàtsbereich der Gruppe jeden vorgeschrie- 
benen Wert annimmt, und zwar jeden genau einmal. Damit sind dann alle 
Fragen beantwortet. 

Die gegebene Skizze der GauBschen Arbeiten zur Theorie der elliptischen 
Funktionen ist sehr unvollstàndig und vereinfacht den komplizierten Ent- 
wicklungsgang ganz erheblich. DaB wir den NachlaB von GAUSS hier verstehen, 
verdanken wir vor allem der hingebenden Arbeit von L. SCHLESINGER. Sein 
Schiler GEPPERT hat dann den NachlaB von GAUSS zur Theorie der elliptischen 
Funktionen mit verhàiltnismaBig geringen Erganzungen zu einem lesbaren 
Ganzen gestaltet [5]. 


In der gewohnlich unter dem abgekirzten Titel « Determinatio attractionis 
annuli elliptici » (Werke III) zitierten Abhandlung tber die sdkularen Sté- 
rungen, die ein Planet auf einen anderen austbt, hat GAUSS einige seiner 
Ergebnisse uber elliptische Integrale (nicht iber elliptische Funktionen) und 
ihre Beziehungen zum arithmetisch-geometrischen Mittel mitgeteilt. Zur Be- 
stimmung dieser Stérungen gentigt es, den Planeten zu ersetzen durch seine 
Bahnellipse, tiber die seine Masse so verteilt ist, daB die auf einem Bogen 
liegende Masse zur Gesamtmasse sich verhalt, wie die zur Durchlaufung des 
Bogens notige Zeit zur Gesamtumlaufszeit. Die Anziehung, die ein solcher 
Ring nach dem Newtonschen Gesetz ausùbt, kann durch elliptische Integrale 
ausgedriickt werden. 


Seine Arbeiten iber elliptische Funktionen veranlaBten GAUSS auch, sich 
mit linearen Differentialgleichungen zweiter Ordnung zu beschaftigen. Von 
seinen Untersuchungen hat er nur einen ersten Teil unter dem Titel « Disqui- 
sitiones generales circa seriem infinitam 


oP, Me TUBB TY), let +2) ae ANB 2) 


pees pp a NOS as 
Tha 1-2-y(y + 1) l dl prosa 5 


1-2-3-y(y + 1)(y +2) 


(Werke III) 1813 veròffentlicht. Die Summe F(«, 8, y, 7) dieser « hypergeo- 
metrischen Reihe » ist schon deshalb wichtig, weil eine Unzahl elementarer 
und hòherer Funktionen, so z.B. die Perioden elliptischer Integrale als Funk- 
tionen des Moduls, durch Spezialisierung der Parameter «, 6, y aus ihr erhalten 
werden kònnen. Daf Gauss hier vom Komplexen keinen Gebrauch macht, 
ist merkwiirdig, da er in der geplanten Fortsetzung es doch hatte tun miissen. 
In seiner Abhandlung verwendet GAuss wohl als erster konsequent den mo- 
dernen Begriff der Reihenkonvergenz. Fiùr EULER war der Ausgangspunkt stets 
eine Funktion, die in eine Reihe entwickelt wurde. Operiert EULER mit sol- 
chen Reihen auch fiir Argumente, fiir welche sie divergieren, so làBt sich das 
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eigentlich immer, wenn auch oft recht muhsam, durch ein Operieren mit den 
Funktionen ersetzen. Bei GAUSS ist aber der Ausgangspunkt die Reihe, welche 
durch ihre Summe erst die Funktion definiert. Deshalb die Forderung der 
Konvergenz. Was Gauss fehlt, ist das Prinzip der analytischen Fortsetzung. 
Daher wahlt er als Ausweg die Erklàrung seiner Funktionen auBerhalb des 
Konvergenzkreises als Lòsungen der linearen Differentialgleichung, der sie 
genugen. Auf den reichen Inhalt der Arbeit und der geplanten Fortsetzung 
naher einzugehen, verbietet der Raum, doch sei wenigstens darauf hinge- 
wiesen, dal} sie in Fortsetzung von Untersuchungen EULERS eine Theorie der 
Gammafunktion (Gauss nennt sie //-Funktion, I7(x) = I(w+1)) enthiilt. 


Aus dem Briefwechsel geht hervor, daB Gauss bekannt war, daB das In- 
tegral einer eindeutigen Funktion im Komplexen eine mehrdeutige Funktion 
der oberen Grenze sein kann. Insbesondere weiB er, daB Il dx/x erstreckt iber 
einen geschlossenen, 2 = 0 n-mal umschlingenden Weg gleich 2nzi ist. Ferner 
ist ihm bekannt, daB, modern ausgedrickt, das Integral einer eindeutigen 
analytischen Funktion in einem einfachzusammenhangenden Regularitàts- 
bereich vom Integrationsweg unabhangig ist. GAuss hat damit wesentliche 
Ergebnisse von CAUCHY um Jahrzehnte vor diesem besessen. 


Gauss’ Dissertation vom Jahre 1799 « Demonstratio nova theorematis 
omnem functionem algebraicam rationalem integram unius variabilis in factores 
reales primi vel secundi gradus resolvi posse » (Werke III) ist dem sog. Fun- 
damentalsatz der Algebra gewidmet. Es sei 


Ple) = Gy kagtasget.... Aye" + e", 


ein Polynom mit reellen Koeffizienten. Wir beweisen heute, dab es n reelle 
oder komplexe (nicht notwendig verschiedene) Zahlen 2,, 2, ..., 2n gibt, so dal 


P(g) = (2— &,)(@ — 2) ..: (2 — Zn) 


ist. Da zu jedem komplexen 2; auch das konjugiert komplexe auftritt, kann man 
die nichtreellen Faktoren paarweise zu reellen Faktoren zweiten Grades zusam- 
menfassen. Daher die Formulierung des Satzes im Titel der Dissertation. Der 
Beweis von GAUSS verliuft, so weit dies méglich ist, ganz im Reellen. Voraus- 
ceschickt ist eine eingehende Kritik der alteren Beweise. Wie das « Demon- 
i nova » anzeigt, halt GAUSS diese dilteren Beweise nicht fir falsch, sondern 
fiir erginzungsbediirftig. Setzt man 2 = @+iy =r (cosp+i sing), so wird 
P(z) = Q(0,y) + iR(a, y). Die Nullstellen von P(z) findet man also als Schnitte 
der Kurve Q(a, y) = 0 mit der Kurve R(x, y) = 0. Da man leicht einsieht, 
daB fiir sehr groBe r die Kurve Y = 0 naherungsweise zusammenfallt mit den 
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Strahlen durch w = y = 0, fiir die cos np = 0 ist, die Kurve R = 0 aber mit 
den Strahlen, fiir die sin ng = 0 ist, und da keine der beiden Kurven im 
Endlichen frei enden kann, so schlieBt man geometrisch leicht auf das Vorhan- 
densein von Schnittpunkten. Gauss hat fiir denselben Satz spater noch drei 
weitere Beweise gegeben (Werke III). Der zweite Beweis ist eine Prazisierung 
des in der Dissertation besprochenen Beweisversuches von EULER und FoN- 
CENEX, der vierte gibt eine Neuformung des Beweises der Dissertation. Be- 
sonders interessant ist der dritte Beweis. Er ist im Grunde eine Anwendung 
des Residuensatzes auf die logarithmische Ableitung von P(z). Verschleiert wird 
das dadurch, daB das Kurvenintegral mit Hilfe des GauBschen Integralsatzes 
aus der Potentialtheorie in ein Flachenintegral umgeformt ist. 


Den Ruhm von Gauss haben seine 1791 beginnenden Untersuchungen zur 
Zahlentheorie begriindet, die in der Hauptsache in den 1801 erschienenen 
« Disquisitiones arithmeticae » (Werke I) enthalten sind. Dieses Werk umfaBt 
drei deutlich gesonderte Teile. Der erste, der Abschnitt 1 bis 4, beruht auf 
der Darstellung ganzer Zahlen als Produkte von Primzahlen, gehòrt also zur 
« multiplikativen » Zahlentheorie, der zweite, die Abschnitte 5 und 6, behandelt 
die quadratischen Formen und gehért in die additive Zahlentheorie. Die 
spàtere Entwicklung hat daraus die Zahlentheorie der quadratischen Zahl- 
kòrper entwickelt, d.h. die Einordnung in die multiplikative Zahlentheorie 
vollzogen. Der dritte Teil, Abschnitt 7, gibt die Anwendung der Zahlentheorie 
auf die Kreisteilungsgleichung. Wie schon oben gesagt wurde, war GAUSS 
bekannt, da sich diese Untersuchungen weiter ausdehnen lieBen. z.B. auf die 
Gleichungen fiir die Teilung der Lemniskate. Der erste Teil war im wesent- 
lichen fertig, als GAUSS 1795 nach Géttingen kam. Er muBte dort entdecken, 
daf vieles, was er selbstàndig gefunden hatte, bereits FERMAT, EULER, LA- 
GRANGE und LEGENDRE bekannt war, doch blieb genug ibrig, was ihm allein 
gehorte. Der zweite Teil geht wesentlich auf Anregungen durch die Arbeiten 
von LAGRANGE zurùck. 

Fin wesentliches Hilfsmittel ist fir GAuss der von ihm eingefiihrte Begriff 
der Kongruenz zweier ganzer Zahlen a und b nach einem ganzzahligen und 
von Null verschiedenen « Modul» m: a=bmodm (gelesen «a kongruent d 
modulo m ») bedeutet, daB a — b durch m ohne Rest teilbar ist. Die Kongruenz 
hat, und das ist wesentlich, die charakteristischen Eigenschaften einer Gleich- 
heitsbeziehung. So kommt man zu einer Analogie zwischen Kongruenzen 
und Gleichungen. Z.B. tritt neben die Gleichung «x? = D die Aufgabe, bei 
gegebenem und durch m nicht teilbaren ganzen D die ganzzahligen x zu finden, 
fir welche «?= D mod m ist. Aus dem Inhalt des ersten Teiles der Disqui- 
sitiones arithmeticae greifen wir zwei besonders wichtige Ergebnisse heraus. 
Ist p eine Primzahl und a kein Vielfaches von p, so gibt es einen kleinsten 
positiven Exponenten ¢, fiir den a'= 1 mod p ist. Man sagt dann: «a gehòrt 
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zum Exponenten ¢». Dieses ¢ ist entweder gleich p—1 oder ein Teiler von 
p—1. Eine Folge davon ist zunàchst der «kleine Fermatsche Satz »: « Stets 
ist a’*=1modp». Zu jedem Teiler ¢ von p—1, insbesondere also auch 
zum Exponenten p—1 gehéren dann wirklich Zahlen a. Die zu p—1 ge- 
hérigen nennt man « primitive Wurzeln mod p ». Ist g eine solche, so ist jede 
durch p nicht teilbare Zahl einer der Potenzen 1 = 9°, g', g?, ..., g?-? mod p 
kongruent. Die Wichtigkeit dieses Ergebnisses zeigt seine Verwendung im 
dritten Teil der Disquisitiones. 


Die Frage nach der Lésbarkeit der Kongruenz v?= Dmodm kann man 
auf den Fall zurickfihren, daB m eine ungerade Primzahl p und D nicht 
durch p teilbar ist. Ist in diesem Fall die Kongruenz lòsbar, so nennt man D 
einen quadratischen Rest mod p und setzt mit LEGENDRE (GAuss benutzt 
diese Symbolik nicht) (D/p) =+ 1, ist sie unlòsbar, so ist D ein quadra- 
tischer Nichtrest mod p, und man setzt (D/p) =—1. Man kann nun eine 
doppelte Frage stellen: Welche D sind bei gegebenem p quadratische Reste 
und welche Nichtreste? Fir welche p ist eine gegebenes D quadratischer 
Rest, fiir welche Nichtrest? Fir D=-—1 und D = 2 war die zweite Frage 
schon von EULER beantwortet. EuLER, LEGENDRE und GAUSS, dieser durch 
numerische Induktion, hatten unabhàngig von einander den Satz entdeckt, 
der in LEGENDREs Formulierung lautet; «Sind p und q verschiedene unge- 
rade Primzahlen, so ist stets 


(1) : (4) — (— 1 )@-nle- (Cin Oa 
GT) NP; 


GAUSS nennt diesen Satz den Fundamentalsatz (theorema fundamentale) 
der Theorie der quadratischen Reste, heute nennt man ihn nach der gegebenen 
Formulierung mittels des Legendreschen Symbols meist das quadratische 
Reziprozitàtsgesetz. Er besagt, ausfiihrlich ausgesprochen: ist mindestens eine 
der Primzahlen p und gq von der Form 4n+1, so sind die Kongruenzen 


et=qmodp und y?=pmod¢ 


beide lòsbar oder beide unlésbar, sind dagegen beide Primzahlen von der Gestalt 
4n-+3, so ist eine der Kongruenzen lésbar, die andere unlosbar. LEGENDRE 
hatte den Beweis dieses Satzes versucht, muBte dabei aber eine ihm selbst 
bekannte Liicke lassen, die erst DIRICHLET auszufiillen vermochte. GAUSS 
lieferte, ohne von LEGENDRE zu wissen, 1796 den ersten noch recht muhsamen 
Beweis, der in den Disquisitiones veréffentlicht wurde. Im zweiten Teil der 
Disquisitiones gab er einen zweiten Beweis, dem noch sechs weitere folgten. 
Das Reziprozitatsgesetz hat seine tiberragende Bedeutung immer wieder 
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gezeigt, sei es als Beweishilfsmittel, sei es dadurch, daB jeder neue Beweis 
fiir diesen Satz zu neuen Fragestellungen fiihrte, sei es durch die von ihm aus- 
vehenden Versuche, analoge Aussagen etwa fur kubische oder biquadratische 
Reste zu finden. 


Der zweite Teil der Disquisitiones befaBt sich mit der Darstellung ganzer 
Zahlen n durch sog. (binàre) quadratische Formen, d.h. mit Darstellungen der 
Gestalt n = ax*+ 2bxy + cy?. Dabei sollen a, b, c fest vorgegebene ganze 
Zahlen und auch x und y ganzzahlig sein. Sind dabei x und y teilerfremd, so 
nennt man die Darstellung von 7 eine eigentliche Darstellung. Auf solche 
eigentliche Darstellungen darf man sich beschranken. Geht die Form 
aaw?-+ 2bey + cy? durch eine lineare Substitution # = ax'+ By’, y = ya + oy’ 
mit ganzzahligen Koeffizienten «, 6, y, 6 in die Form a‘a’?+ 2b'a'y'+ cy” 
iiber und làBt sich diese durch eine ebensolche Substitution wieder in die 
urspriingliche zuritckverwandeln, so nennt man beide Formen Aquivalent. 
Aquivalente Formen stellen offenbar dieselben Zahlen n dar. Fir die linearen 
Substitutionen, die aquivalente Formen in einander ùberfùhren, muf not- 
wendig xò — fy = +1 sein. Steht hier rechts +1, so nennt man die Formen 
eigentlich aquivalent (auf diesen Fall darf man sich beschranken), sonst 
uneigentlich aquivalent. Die Substitutionen mit xò — fy =+ 1 sind aber 
nichts anderes, als die bei den elliptischen Funktionen auftretenden Modul- 
substitutionen. Es zeigt sich nun, daB bei Aaquivalenten Formen b?— ac = 
= b'?— a'c' ist. Die Gleichheit dieser Zahlen, die GAUSS die Determinante D 
der Form nennt, heute nennt man sie meist Diskriminante, ist somit not- 
wendig fiir die Aquivalenz. Hs ist leicht an Beispielen zu sehen, daB nicht alle 
quadratischen Formen derselben Determinante D aquivalent zu zein brauchen 
und daff nicht jede Zahl durch eine solche Form darstellbar sein wird. Fir 
die Darstellbarkeit einer Zahl n durch eine Form mit der Determinante D 
erweist sich als notwendig die Lòsbarkeit der Kongruenz <2= D mod n. Damit 
ist die Verbindung zum Reziprozitàtsgesetz hergestellt. GAUSS zeichnet unter 
allen Formen mit der Determinante D gewisse, deren Koeffizienten durch 
Ungleichheiten beschrànkt sind, aus. Diese Formen nennt er reduzierte. Zu 
jedem D gehòren nur endlich viele verschiedene reduzierte Formen und jede 
Form mit der Determinante D ist einer dieser reduzierten Formen Aquivalent 
(wobei es méglich ist, dal} auch reduzierte Formen unter sich Aquivalent sind). 
FaBt man alle zu einer bestimmten Form der Determinante D Aquivalenten 
Formen in eine Klasse zusammen, so gibt es demnach nur endlich viele Klassen 
fir ein gegebenes D. Da die zur Klasseneinteilung der quadratischen Formen 
fiihrenden Uberlegungen nicht rein formal sind, sondern sich wesentlich auf 
innere Higenschaften der Formen griinden, darf man erwarten, daB sie auch 
zu zahlentheoretischen Satzen fiihren. In der Tat folgen jetzt von FERMAT 
angegebene und von EULER und LAGRANGE bewiesene Siitze iiber die Dar- 
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stellung gewisser Primzahlen durch spezielle quadratische Formen, denen GAUSS 
weitere der gleichen Art anfiigt. Vor allem aber folgen die weiteren Ent- 
wicklungen des zweiten Abschnitts der Disquisitiones, deren Darstellung sich 
wohl einer elementaren Darlegung entzieht. 


In den « Disquisitiones arithmeticae » kommen nirgends geometrische Uber- 
legungen vor, nirgends werden geometrische Beziehungen auch nur angedeutet. 
Nun legt schon die Bezeichnung « kongruent » die Vermutung nahe, daB solche 
Beziehungen bestehen miissen, zur GewiBheit wird sie durch das Auftreten 
der Modulsubstitutionen bei der Uberfiihrung einer quadratischen Form in 
aquivalente. In der Theorie der elliptischen Funktionen dienen diese Substi- 
tutionen dazu, um von einem Periodengitter zu einem Aquivalenten, d.h. gleich- 
wertigen, ùberzugehen. Fir die Darlegung des Zusammenhangs beschrànken 
wir uns auf den Spezialfall einer positiv definiten quadratischen Form 
ax? + 2bxy + cy®, d.h. einer Form, die auBer fiir x = y = 0 nur positive Werte 
annimmt. Es ist dann a>0,b>0 und D<0. Jeder solehen Form ordnet 
man umkehrbar eindeutig ein Parallelogrammgitter folgendermassen zu: In 
einem XY-Koordinatensystem mit dem Achsenwinkel g, der durch cos p = 
= b/\/ae bestimmt ist, zieht man durch die Punkte X= xVa (x ganz) der 
X-Achse die Parallelen zur Y-Achse und ebenso durch die Punkte Y= yve 
(y ganz) der Y-Achse die Parallelen zur X-Achse. Die Ecken (eva, yve) 
dieses Parallelogrammgitters nennen wir Gitterpunkte. Betrachten wir nur 
diese, so gibt es unendlich viele Parallelogrammgitter, die zu denselben Gitter- 
punkten gehdéren, und zwar entsprechen alle diese den Formen, die zur Aus- 
gangsform aquivalent sind. Stets haben die Maschen eines solchen Netzes 
die Fliche V— D. Die Parallelogrammgitter entsprechen so den einzelnen 
Formen, das System der Gitterpunkte aber der von diesen Formen gebildeten 
Klasse. Bei den (terniren) quadratischen Formen in drei Veranderlichen, die 
Gauss ebenfalls heranzieht, kommt man ebenso auf Punktgitter im Raum. 
Diesen Zusammenhang hat Gauss nur gelegentlich bei der Besprechung einer 
Arbeit eines seiner Schiler angegeben, und er hat dabei auch den Zusam- 
menhang mit der Kristallographie hervorgehoben (Werke II, S. 188). Die 
Kongruenzen geben, wie man nun sofort sieht, die eindimensionalen Punkt- 


gitter. 


1796 erschien GAuss erste Veròffentlichung, seine Mitteilung, daB das 


regelmaBige 17-Eck mit Zirkel und Lineal allein konstruierbar ist (Werke X,). 


Dies war seit den griechischen Mathematikern der erste Fortschritt in der 
Theorie der regelmiGigen Vielecke. Aus dieser Entdeckung ist wohl der dritte 
Teil der Disquisitiones entsprungen. 


Der Teilung des Kreises in n gleiche Teile entspricht algebraisch die Lòsung 
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der Gleichung «"—1= 0. Da man in dieser Gleichung den Linearfaktor #—- 1 
abspalten kann, hat man nur die Gleichung (die wir kunftig Kreisteilungs- 
gleichung nennen werden) 


Het Capt eet eee 1=0 


zu betrachten. Dabei beschranken wir uns auf den Fall, daf eine Primzahl 
ist. Die Lésungen dieser Gleichung sind die ganzen Potenzen von eri, aber 
damit ist die Berechnung der Lòsungen auf transzendente Hilfsmittel (unendliche 
Reihen) zuriickgeftihrt. Das verdeckt die wesentliche Eigenschaft der Losungen, 
algebraische Zahlen zu sein, und verhindert damit jede tiefere Einsicht in 
ihren Zahlcharakter. Daher gibt GAUSS eine systematische algebraische Auf- 
losungstheorie fiir die Kreisteilungsgleichung. Zunachst zeigt er, da£ die Glei- 
chung irreduzibel ist, d.h., daB die linke Seite nicht als Produkt zweier Poly- 
nome mit rationalen Koeffizienten geschrieben werden kann. Es sei weiter 7 
eine Lòsung der Kreisteilungsgleichung. Dann ist 7” = 1. Gibt man weiter 
nm—1 ganze Zahlen «, die in irgend einer Reihenfolge den Zahlen 1, 2, ..., 
n—1 nach dem Modul » kongruent sind, so sind die r* sàmtliche Lòsungen 
der Kreisteilungsgleichung. Fur die Zahlen x nimmt nun GAuss die Potenzen 
g =1, 9, 9?, ...,.g"-? einer primitiven Wurzel g mod n. Wichtig ist, daf alles 
weitere von der Wahl von g unabhàngig ist, d.h. da’ man dasselbe Ergebnis 
erhalt, wenn man von verschiedenen primitiven Wurzeln mod n ausgeht. n—1 
ist als gerade Zahl sicher zusammengesetzt. Es sei n —1 = e-f, wobei keiner der 
Faktoren gleich 1 ist. Dann ordnet GAuss die Potenzen von r in e Systeme zu 
je f Lòsurigen, und zwar so, dafi in jedem System die Exponenten von g (die 
Potenzen von g sind ja jetzt unsere «) eine arithmetische Progression mit der 
Differenz e bilden. Es seien 7, 7, ..., 7-1 die Summen der Potenzen von r 
in'jedem dieser Teilsysteme. Dann geniigen die 7 einer Gleichung vom Grad e 
mit ganzen rationalen Koeffizienten. Ist f keine Primzahl, sondern zusam- 
mengesetz, f = e'-f', so betrachten wir die Potenzen von r, die etwa in 7% 
vorkommen. Diese ordnen wir ebenso in e’ Teilsysteme zu je f’ solcher Lé- 
sungen, bei denen die Exponenten von g arithmetische Progressionen mit der 
Differenz e-e’ bilden. Bilden wir dann in jedem System die Summen der 
r-Potenzen, so geniigen diese einer Gleichung vom Grad e’, deren Koeffizienten 
linear von den 7 abhangen. Dieses Verfahren wiederholt man so lange wie 
mòglich. Hat n—1 die Primfaktorzerlegung n—1 = p%-p%... p%, so wird 
auf diese Weise die Auflésung der Kreisteilungsgleichung zuriickgefithrt auf 
die Lòsung von o, Gleichungen vom Grade p,, 0, Gleichungen vom Grade p,, 

+ 3 0x Gleichungen vom Grade p,. Die Konstruktion eines regulàren Vielecks 
mit Zirkel und Lineal allein gelingt immer dann und nur dann, wenn ihm 
algebraisch eine Lésung der Kreisteilungsgleichung durch eine Kette quadra- 
tischer Gleichungen entspricht, d.h. wenn n—1 eine Potenz von 2 ist. Es 
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ist leicht zu zeigen, daB, da n Primzahl sein soll, der Exponent von 2 wieder 
eine Potenz von zwei sein muB. 


Von den spàteren zahlentheoretischen Arbeiten von Gauss erwihnen wir 
nur, dafì er bei seinen Untersuchungen iiber biquadratische Reste (Werke IT) 
aus dem Kérper der rationalen Zahlen herausgeht und den heute sogenannten 
GauBschen Zahlenkérper einfiihrt. Dieser besteht aus allen komplexen Zahlen 
a+-bi mit rationalem a und b. Ganze Zahlen sind in diesem Kérper die Zahlen 
mit ganzem rationalem a und b. Damit war die urspriingliche Beschrankung 
der Zahlentheorie auf den rationalen Zahlkérper beseitigt und der Weg zur 
Fortentwicklung frei gemacht. 


Im Nachlaf von Gauss finden sich noch eine Reihe von Satzen iiber sog. 
asymptotische Gesetze in der Zahlentheorie. Wir erwihnen nur ein Beispiel: 
Ist #(x) die Anzahl der Primzahlen, die x nicht iibertreffen, so gibt Gauss 
an, dab 


et ee dé 
pts 1 sini sno = 
m(x) J dh OE 
: 2 


ist. Der Beweis dieses « Primzahlsatzes » gelang erst 1896 HADAMARD und 
DE LA VALLEE Poussin. GAUSS selbst hat fir diesen und andere hierher- 
gehorige Satze wohl keinen schlissigen Beweis besessen. 


In der Nacht des 1. Januar 1801 (nach astronomischer Datierung) beo- 
bachtete G. PIAZZI in Palermo ein Sternchen mit einer betrachtlichen Higen- 
bewegung. Dieses konnte er sechs Wochen lang messend verfolgen. Er war 
sich im Zweifel, ob es sich um einen Kometen oder einen neuen Planeten 
handelte. Die Mitteilung seiner Entdeckung langte erst nach Monaten in 
Paris an und fand dort wenig Beachtung. Noch viel spater erreichte sie die 
deutschen Astronomen. Da es sich um ein wenig auffallendes Sternchen han- 
delte, konnte man nur dann hoffen, es bei der nachsten Opposition wieder- 
zufinden, wenn man seinen Ort wenigstens nàherungsweise kannte. Um eine 
solehe Angabe zu machen, mubte man die Bahn des neuen Planeten voraus- 
berechnen. Nun trat die Aufgabe, fiir ein neues Mitglied unseres Planeten- 
systems die Bahn zu ermitteln, nicht zum ersten Mal auf. 1781 hatte HERSCHEL 
den Uranus entdeckt. Da es sich dabei offenbar um einen grofen Planeten 
handelte und man von diesen gewohnt war, dab die Balinebene von der Ebene 
der Ekliptik wenig verschieden und die Exzentrizitàt der Bahn klein war, 
berechnete man fiir Uranus zuerst eine Kreisbahn in der Ekliptikebene. Diese 
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Naherung war so gut, daB man mit ihr eine groBe Anzahl alterer Beobachtungen, 
die sich tiber ein Jahrhundert verteilten, auffinden und zur Bahnverbesserung 
benutzen konnte. Fir die zahlreichen nichtperiodischen Kometen war man 
sewohnt Parabelbahnen zu berechnen, d.h. die Exzentrizitàt gleich 1 zu setzen. 
Ein Verfahren zu einer Bestimmung einer Parabelbahn hatte schon I. NEWTON 
angegeben (« Philosophiae naturalis principia mathematica, Liber III Propo- 
sitio XLI Problema XXI: Cometae in Parabola moti Trajectoriam ex datis 
tribus Observationibus determinare »). OLBERS hatte, gestiitzt auf die Arbeiten 
von LAMBERT und anderen, ein neues besseres Verfahren angegeben, das im 
wesentlichen heute noch in Gebrauch ist. HALLEY hatte fir die Kometen 
von 1682, 1607, 1531 und 1456 Parabelbahnen berechnet, die eine so auffal- 
lende Ubereinstimmung der Elemente, d.h. der sechs die Bahn festlegenden 
Zahlen, aufwiesen, daB er schloB, es miisse sich um ein und denselben Kometen 
handeln. Damit war nicht nur der erste periodische Komet entdeckt, sondern 
fiir diesen auch die Umlaufszeit und damit die halbe Achse der Bahnellipse 
bekannt. D.h. er kannte ein sehr wichtiges Element der Bahn. Da er weiter 
schlieBen konnte, daB die von ihm berechneten Parabeln in der Nahe des 
Perihels die Bahnellipse gut annàherten, hatte er auch fiir die ibrigen Ele- 
mente brauchbare Naherungen. 

Im Falle des vom PIAZZI entdeckten und Ceres genannten Planeten war 
nun ftir keines der sechs Elemente eine brauchbare Naherung bekannt. Zwar 
versuchte man hier auch die Berechnung einer Kreisbahn und einer Parabel- 
bahn und sogar unter willkirlicher Annahme des Ortes des Perihels einer El- 
lipsenbahn. Aber alle diese Versuche hatten, wie sich hinterher herausstellte, 
zu keinem brauchbaren Ergebnis geftithrt. In diesem Stadium griff GAUSS ein. 
Er stellte sich die Aufgabe, eine Bahn ohne jede willkiirliche Annahme ber 
die sechs Elemente auf Grund der Beobachtungen zu berechnen. Dazu miissen 
in der Regel drei Beobachtungen geniigen, von denen jede Deklination und 
Rektaszension des Planeten zur Beobachtungszeit ergibt. Die Gleichungen, 
welche die Beobachtungen mit den Elementen verbinden, sind ziemlich ver- 
wickelt und kénnen nur durch Naherungen gelést werden. Dabei kommt es 
wesentlich auf eine geschickte Anordnung und Fiihrung der Rechnung an. 
AuBerdem liegen i.a. mehr als drei Beobachtungen vor. Selbst wenn man fiir 
die Auswahl von dreien unter allen eine verniinftige Regel hitte, so bliebe 
immer noch die Frage, wie man die tiberschiissigen Beobachtungen verwendet, 
um die aus den vorher ausgewahlten Beobachtungen ermittelte Bahn zu ver- 
bessern. GAuss geht daher einen anderen Weg, bei dem die Gleichberechtigung 
aller Beobachtungen leitender Gedanke ist. Er benutzt dazu die von ihm 
entwickelte Ausgleichsrechnung, heute meist Methode der kleinsten Quadrate 
genannt. Uber diese wird spiter noch einiges zu-sagen sein. Mit ihrer Hilfe 
berechnet er aus allen Beobachtungen drei « Normalorter ». Diese legt er der 
Bahnbestimmung zu Grunde. Aus der Art, wie diese Normalòrter gebildet 
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werden, folgt, da’ bei ihnen die unvermeidlichen Beobachtungsfehler gut kom- 
pensiert sind. Dies war schon deshalb wichtig, weil bei der Ceres sich die 
Beobachtungen auf ein sehr kurzes Stiick der Bahn verteilten. So gelang 
Gauss die Lésung der selbstgestellten Aufgabe. Er berechnete auf Grund der 
von ihm ermittelten Elemente eine Ephemeride der Ceres, mit der dann die 
Wiederauffindung sofort gelang. Dieser auch dem Nichtfachmann verstiind- 
liche Erfolg machte Gauss weithin bekannt und beriihmt. 

Die nachsten Jahre verwendete Gauss auf die Verbesserung seines Bahn- 
bestimmungsverfahrens. Dieses hatte inzwischen erhéhtes Interesse dadurch 
gewonnen, dafì in rascher Folge drei weitere kleine Planeten, Pallas, Juno 
und Vesta, entdeckt worden waren. Die zusammenfassende Veréffentlichung 
erschien erst 1809 als « Theoria motus corporum coelestium in sectionibus 
conicis solem ambientium » (Werke VII). 

Die Ceres wurde nicht genau an dem vorausberechneten Ort wiedergefunden. 
Dies lag in erster Linie an der Nichtberiicksichtigung der Stòrungen ihrer 
Bahn durch die groBen Planeten. Fiir den zu erreichenden Zweck, die Wieder- 
auffindung der Ceres, war es nicht nétig, diese Stòrungen zu ermitteln. AnBer- 
dem ware das bei der Kurze der zur Verfiigung stehenden Zeit und dem dama- 
_ligen Stand der Theorie auch kaum méglich gewesen. Es war aber nur fol- 
gerichtig, daB GAUSS sich im AnschluB an seinen Erfolg mit den Stérungen der 
Ceresbahn beschaftigte. Er beendete diese Arbeit aber nicht, sondern wandte 
sich, wohl veranlaBt durch ein Preisausschreiben der Pariser Académie des 
Sciences, der Ermittlung der Pallasst6rungen zu. Die von ihm hier benutzten 
Methoden waren neuartig. Aus uns unbekannten Grilnden brach er aber auch 
diese Rechnungen und Untersuchungen kurz vor ihrer Vollendung ab. 


Bei der Ubernahme der Sternwarte in Gottingen fand GAuss kaum brauch- 
bare Instrumente vor. Schon nach den ersten Neuanschaffungen begann eine 
rege Tatigkeit. Zunàchst handelte es sich um die Bestimmung der Koordi- 
naten der Sternwarte und dann um die Ermittlung wichtiger astronomischer 
Konstanten. Es muften solche Merkwiirdigkeiten aufgeklart werden, wie die, 
daB die Messungen der Schiefe der Ekliptik nicht nur im Sommersolstitium 
wesentlich andere Werte ergaben, wie die Messungen im Wintersolstitium, 
sondern da auch die Messungen verschiedener Astronomen hier keine Uber- 
einstimmung zeigten. 

Eine Ausnahme bildeten allein die Messungen von BEsseEL und die von 
diesem reduzierten Messungen von BRADLEY. Dies fiihrte zu einer eingehenden 
Untersuchung der méglichen Fehlerquellen in enger Zusammenarbeit mit 
BESSEL. Diese der Erhéhung der Genauigkeit astronomischer Messungen ge- 
widmeten Untersuchungen setzten sich durch viele Jahre fort, jedoch ging 
die Fiihrung bei der groBen Belastung von GAuss durch andere Arbeiten immer 
mehr an BESSEL ilber, Es ist aber fesselnd zu sehen, wie diese Untersuchungen 
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in engstem Zusammenhang stehen mit zahlreichen anderen, bei denen fast 
der ganze Umkreis der messenden Astronomie durchschritten wurde, von 
Untersuchungen iiber die Refraktion bis zu solchen ilber die Eigenbewegung 
des Sonnensystems. Von rein astronomischen Ergebnissen sei nur das schòne 
Verfahren zur gleichzeitigen Bestimmung von Polhòhe und Uhrkorrektion 
durch Feststellung der Uhrzeiten, in denen drei Sterne dieselbe Zenitdistanz 
erreichen, genannt. Der Bau der GAUSS bei seiner Berufung in Aussicht ge- 
stellten neuen Sternwarte riickte nur langsam vorwarts, da die napoleonischen 
Kriege und die MiBwirtschaft des Hofes die Finanzkraft des Staates wber- 
mibig erschépften. Auch nach dem Wiener Kongress dauerte die Geldnot 
noch viele Jahre an. SchlieBlich aber entstand ein fur die damalige Zeit gut 
ausgestattetes Institut. Lange MefBreihen, wie sie die Astronomen lieben und 
auch benétigen, lieferte zwar die neue Sternwarte nicht. Dafir haben aber 
zahlreiche bekannte Astronomen hier ihre Ausbildung erhalten. 


1816 schlug SCHUMACHER, der im Auftrag der dinischen Regierung an 
einer Gradmessung von der Nordspitze Jiitlands bis an die Elbe arbeitete, 
GAUSS die Fortsetzung dieser Messung nach Stiden vor. GAUSS ging darauf 
mit Freuden ein. Es mag dabei die Kritik, die GAUSs an der franzòsischen 
Gradmessung von 1792 schon deshalb ùbte, weil fiir die Abplattung der Erde 
nicht der aus den Messungen folgende Wert, sondern ein auf die Gradmessung 
in Peru zuriickgehender genommen wurde, mitgespielt haben. Eng damit 
zusammen hingt die Erkenntnis, daB die Gestalt der Erde, die GAUSS als 
erster als Niveauflache der Schwerkraft erklart, kein Ellipsoid sein kénne. 
GAuss hatte, als er auf SCHUMACHERS Vorschlag einging, schon weiterreichende 
Plaine. Er wollte versuchen, seine Gradmessung mit den Vermessungsarbeiten 
in Bayern und Oesterreich in Verbindung zu bringen, um so einen Meridian- 
bogen zu erhalten, der vom aufersten Norden Jiitlands bis nach Oberitalien 
reichte. Dieses Ziel wurde trotz des Hinzutretens der Triangulation Kur- 
hessens durch den GauBschiler GERLING nicht erreicht. Nicht gut zu machen 
war auch der Umstand, daf die Basismessung SCHUMACHERS nie gehòrig re- 
duziert wurde. GAuss legte seine Dreieckspunkte in der Hauptsache an die 
gleichen Stellen, welche bei der franzésischen Kriegsvermessung Hannovers 
von den franzòsischen Vermessungsoffizieren ausgewahlt waren. Dies geschah 
wohl deshalb, weil er selbst bei Beginn der Messung in diesen Dingen noch 
keine Erfahrung besa£ und er deshalb aus der gròBeren praktischen Erfahrung 
seiner Vorgànger Nutzen ziehen wollte. Viele Vorwiirfe, die sich gegen die 
Auswahl der Dreieckspunkte und andere Umstinde richten, sind sicher gegen- 
standslos, da sie Verhàltnisse als gegeben voraussetzen, die damals nicht vor- 
lagen, und da sie keine Riicksicht auf die Knappheit der Geldmittel nehmen, 
mit denen GAUSS auszukommen hatte. So hat schlieBlich das ganze Unter- 
nehmen nicht zum erhofften Ergebnis gefiihrt. Dennoch ist es in der Geschichte 
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des Vermessungswesens von einzigartiger Bedeutung durch die vielen Unter- 
suchungen und Erfindungen, die mit ihm zusammenhiingen. 


Als erstes ist die Erfindung des Heliotropen zu nennen, durch den eine 
anzuzielende Station markiert wird durch das auf ihr in einem Spiegel nach 
dem Beobachter reflektierte Sonnenbild. Dadurch wurde zum erstenmal die 
Winkelmessung in Dreiecken mit sehr langen Seiten mit der nétigen Sicher- 
heit méglich. Die urspriingliche Konstruktion von GAUSS war ein Meisterstiick 
der Erfindungsgabe. Man sah bald, da8 sie durch wesentlich einfachere Ein- 
richtungen ersetzbar war. Neben dem unmittelbaren Gebrauch in der Ver- 
messung benutzte GAuss den Heliotropen auch zur Ubermittlung vereinbarter 
Botschaften zwischen den Eckpunkten der Dreiecke durch Lichtblitze. Er 
wies auch nachdricklich auf diese Méglichkeit der Lichttelegraphie hin, die 
ihm selbst bei den elenden Post- und Verkehrsverhàltnissen seiner Zeit groBe 
Zeitverluste ersparte. Es darf wohl daran erinnert werden, daB diese Art der 
Telegraphie in Kolonialkriegen kurz nach 1900 eine bedeutende Rolle spielte. 


GAUSS trug nicht nur die Hauptlast der Messungen, sondern er muBte 
auch alle mit den Messungen zusammenhangenden Rechnungen selbst dureh- 
fiihren. Die persònliche intime Kenntnis aller Einzelheiten der Messungen und 
der Rechnungsfiihrung erméòglichte es ihm, fiir diese Messungen und Rech- 
nungen vorbildliche Anweisungen zu geben, die ihn zum eigentlichen Vater 
der modernen Geodàsie gemacht haben. Durch ihn ist die Ausgleichsrechnung 
zum Hauptwerkzeug des rechnenden Geodaten geworden. Aus pers6nlicher 
Erfahrung stammt auch die Erkenntnis der mangelnden Genauigkeit der Win- 
kelmessung durch Repetition und die Erkenntnis der Haufigkeit der Lotab- 
weichungen (d.h. des Nichtzusammenfallens der Normale des der Vermessung 
als Bezugsflàche zu Grunde liegenden Ellipsoids mit der Normalen auf der 
Niveaufliche der Schwerkraft). Uber das Zustandekommen der Lotabwei- 
chungen macht sich dabei Gauss véllig zutreffende Vorstellungen. 


Weiter fiihren Fragen der Geodasie GAUSS zu seinen Untersuchungen tuber 
konforme Abbildung und Differentialgeometrie. Die von GAUSS seit 1795 be- 
nutzte Ausgleichsrechnung wurde unbhingig auch von LEGENDRE entdeckt, der 
sie « Methode der kleinsten Quadrate » nannte. Die erste Darstellung durch 
Gauss enthàlt die Theoria motus, durch die Verzògerungen im Druck dieses 
Werkes kommt aber die Prioritàt der Veròffentlichung LEGENDRE zu. Die 
Ausgleichsrechnung hatte zuniichst den Zweck der im 18. Jahrhundert sehr 
verbreiteten Gewohnheit entgegen zu treten, das Ergebnis von Mefreihen 
dadurch zu verfàlschen, daB man auf Grund irgend welcher Voreingenommen- 
heiten einen Teil der Messungen verwarf und nur die beibehielt, welche das 
erwiinschte Ergebnis lieferten. GAUSS steht auf dem Standpunkt, da grund- 
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siitzlich alle Messungen einer MeBreihe gleichberechtigt sind und da die 
Grundsiitze, nach denen eine Mefreihe zu behandeln ist, von vornherein 
feststehen miiBen. Es handelt sich also um eine Art Gesetzgebung. Einleuch- 
tende praktische Erwàgungen filhren leicht zur Aufstellung von Regeln 
einschlieBlich solcher, die zur Beurteilung der Genauigkeit des Ergebnisses 
dienen. Auf diesem Standpunkt blieb LEGENDRE stehen. GAUSS suchte 
aber den Verfahrensregeln eine tiefere Grundlage zu geben, indem er sie 
mit der Wahrscheinlichkeitsrechnung in Verbindung brachte. Da es grund- 
siitzlich unmdglich ist, die wahren Werte der Gréfen zu bestimmen, welche 
aus den Messungen ermittelt werden sollen, setzt sich GAUSS das Ziel, die 
wahrscheinlichsten Werte fiir sie festzustellen. Dazu braucht er aber die 
Fehlerfunktion (e), d.h. die Funktion, mit deren Hilfe man bei beliebig vor- 
gegebenen a und bd (a<b) die Wahrscheinlichkeit dafir daB der Fehler £ 


b 
zwischen a und d liegt in der Form schreiben kann | g(e) de. Die Bestimmung 


dieses g(e) ist nicht ohne ein Axiom méglich. So beruft sich GAUSS auf den 
allgemein anerkannten Grundsatz, daB bei direkten Messungen einer Gròfe 
das arithmetische Mittel der Messungen fiir sie einen brauchbaren Wert ergibt. 
Dies fiihrt dann zum sog. « GauBschen Fehlergesetz » (h/\/z) exp [— h2e?]. Dabei 
ist h eine Konstante, die man als Maf fiir die Genauigkeit der Beobachtungen 
ansehen kann. 


Mit dieser Begrindungsart der Ausgleichsrechnung war GAUSS aus ver- 
schiedenen Griinden nicht zufrieden. Vor allem war er sich des hypothetischen 
‘ Charakters seines Fehlergesetzes vollkommen bewuBt. Auch fand er, daB die 
Auszeichnung des wahrscheinlichsten Wertes der Natur der Aufgabe nicht 
gerecht werde, da dem wahrscheinlichsten Wert nur eine « unendlich kleine » 
Wahrscheinlichkeit zukommt. Er gab deshalb eine Neubegriindung in der 
dreiteiligen Abhandlung « Theoria combinationis observationum erroribus mi- 
nimis obnoxiae » (Werke IV) von 1821/1823/1826. Der Grundgedanke ist hier 
der Vergleich der Aufgabe, irgend welche GréBen durch Messungen zu be- 
stimmen, mit einem Glicksspiel, bei dem man nur verlieren und nie gewinnen 
kann. Handelt es sich um ein echtes Spiel, so muB der Spieler noch gewisse 
Verfugungsmoglichkeiten haben. Diese wird er zweckmaBig dazu benutzen, 
das Risiko seines Spieles méglichst niedrig zu halten. In unserem Fall kann 
man zunàchst noch frei verfiigen uber die Gròfen, die man durch Messungen 
bestimmen will. Hat man sie gewàhlt, so folgen daraus die Werte fiir die zu 
messenden GròBen. Diese Werte werden von den MeBergebnissen abweichen, 
Die Differenzen zwischen MeBergebnis und berechnetem Wert sind die Fehler. 
Jedem Fehler ¢ antspricht ein « Verlust » V(e), das Risiko ist dann die Summe 
aller Produkte V(e) mal der Haufigkeit von e. Die zweckmaBigste Wahl der 
zu bestimmenden GréBen ist die, welche dieses « Risiko » zu einem Minimum 
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macht. Bestimmt werden sie aber erst durch die vorgeschriebene Spielregel, 
d.h. durch Verfiigung iber die Funktion V(e). Das einfachste und fiir die 
Rechnung bequemste ist hier V(e) = e2 zu nehmen. Dieser Ansatz fiihrt dann 
ohne spezielle Annahmen iber Wahrscheinlichkeiten zum Ziel und zwar nicht 
nur bei direkten, sondern auch bei vermittelnden und bedingten Beobachtungen. 
GAUSS hat sich nun nicht nur mit der allgemeinen Theorie beschaftigt, sondern 
das Rechenverfahren, das man in jedem konkreten Fall anzuwenden hat, auf 
das feinste durchgearbeitet. Es ist auch heute noch im Gebrauch, soweit nicht 
moderne Rechenhilfsmittel Abanderungen erforderten. Die Anwendungen, die 
Gauss machte, liegen in der Astronomie, z.B. bei der Verbesserung einer be- 
rechneten Planetenbahn beim Vorliegen neuer Beobachtungen (« Disquisitio de 
elementis ellipticis Palladis », Werke VI) und bei der « Bestimmung des Brei- 
tenunterschiedes zwischen den Sternwarten von Géttingen und Altona » 
(Werke IX), sowie in der Geodàsie (SchluB der Theoria combinationis). In 
den Allgemeingebrauch eingefiihrt wurde die Ausgleichsrechnung, da den 
Praktikern die Abhandlungen von GAUSss zu schwer waren, erst durch die 
GauBschiler ENCKE, HAGEN und GERLING. 


An die Gradmessung in Hannover schloB sich eine Triangulation des Landes 
an. Zur Berechnung der Dreiecke benutzte GAUSS eine konforme (winkeltreue) 
Abbildung des Ellipsoids in die Ebene. Er konnte sich damit auf Berechnungen 
in der Ebene beschranken und fiir alle Punkte rechtwinklige ebene Koordi- 
naten berechnen. w-Achse seines rechtwinkligen Koordinatensystems war der 
Meridian der Sternwarte in Gottingen, der Koordinatenanfang lag in Gòttingen. 
Veròffentlicht hat GAuUSS tiber diese Projektionsmethode nichts. Erst durch 
L. KRUGER wurde sie der Vergessenheit entrissen, fortgebildet und dem deut- 
schen Vermessungswesen zu Grunde gelegt. Vor dieser direkten Abbildung 
des Erdsphiroids in die Ebene benutzte die Preussische Landesaufnahme fur 
besondere Zwecke die ebenfalls auf Entwicklungen von GAuss zuriickgehende 
Gauss-Schreibersche konforme Doppelprojektion. Bei dieser wird zunàchst das 
Erdspharoid konform auf die Kugel und dann diese konform in die Ebene 
abgebildet. Mit seiner konformen Projektion schuf GAuss etwas wesentlich 
Neues, da bis dahin allgemein die sog. Cassini-Soldnerschen Koordinaten 


benutzt wurden. 


Die theoretischen Grundlagen gab GAuss in den Arbeiten: « Allgemeine 
Lésung der Aufgabe: Die Theile einer gegebenen Flache auf einer anderen 
Flache so abzubilden, daB die Abbildung dem Abgebildeten in den kleinsten 
Theilen 4hnlich wird » (Werke IV) von 1822 und « Untersuchungen iber Gegen- 
stinde der héheren Geodisie I/II » (Werke IV) von 1843/1846. Die erste Ab- 
handlung ist die Bearbeitung einer auf Vorschlag von SCHUMACHER gestellten, 
aber auf Gauss zuriickgehenden Preisaufgabe der Kopenhagener Gesellschaft 
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der Wissenschaften. Gauss zeigt in ihr: Ist das Quadrat des Linienelements 
einer beliebigen reellen Fliche in der Gestalt ds? = Edp?+ 2F dp dg + Gdg? 
gegeben, so kann man stets eine Parametertransformation so angeben, dal 
ds? die Gestelt annimmt ds? = y(A, w)(d/2+ du?). Er zerlegt dazu ds? in zwei 
konjugiert komplexe lineare Differentialformen in dp und dg und integriert 
die Differentialgleichung ds*= 0 mittels eines Eulerschen Multiplikators fur 
jeden der Faktoren. Mit der neuen Form des Linienelements ist aber die kon- 
forme Abbildung der gegebenen Fliche in die Ebene gegeben. Damit ist auch 
die konforme Abbildung von Flachen aufeinander erledigt. In der spateren 
Abhandlung von 1843/1846 behandelt GAuss den Sonderfall der konformen 
Abbildung des Ellipsoids auf die Kugel. 


Auch die Untersuchungen von GAuss zur Differentialgeometrie hangen auf 
auf das engste mit seinen geoditischen Arbeiten zusammen. Nimmt man nicht 
auch den Raum auBerhalb der Flache hinzu, so kann man auf ihr nur GròBen 
von Winkeln und Lingen von Kurven messen. Diese aber andern sich nicht, 
wenn man die Fliche ohne Dehnung verbiegt. Insbesondere bleiben dabei 
geodatische (d.h. kiirzeste) Linien stets kiirzeste. Wahrend die Differential- 
geometrie vor GAUSS nur Eigenschaften der Flachen untersuchte, die bei Be- 
wegungen der Fliche ohne Formanderung erhalten bleiben, muBten sich fur 
GAUSS von seinem Ausgangspunkt her als wesentlich neu die biegungsinva- 
rianten Eigenschaften hinzugesellen. 


Die die Untersuchungen von mehr als einem Jahrzehnt zusammenfassende 
abschlieBende Veréffentlichung « Disquisitiones generales circa superficies cur- 
vas » (Werke IV) von 1828 stellt die systematische Darstellung voran und weicht 
so ganz erheblich von der an den Entwiirfen zu verfolgenden geschichtlichen 
Entwicklung ab. GAuss betrachtet im Anschlu8 an EuLER die Koordinaten 
der Flachenpunkte als Funktionen zweier Parameter p und q. Er beschrankt 
sich auf die Betrachtung regulirer Flachenpunkte, in denen man eindeutig 
eine Normale erklaren kann. Diese wird mit einer Richtung versehen. Jedem 
Flachenstiick ordnet er ein « sphàrisches Bild » zu, indem er jedem Punkt des 
Flachenstiickes den Punkt der Einheitskugel entsprechen lat, der auf dieser 
durch den der Flichennormale parallelen und gleichgerichteten Radius aus- 
geschnitten wird. Dieses spharische Bild ist offenbar von der Gewohnheit der 
Astronomen hergenommen, Richtungen durch Punkte der Himmelskugel fest- 
zulegen. Jedem Flachenstiick schreibt Gauss als « Gesamtkriimmung » (cur- 
vatura integra) den orientierten d.h. mit Vorzeichen versehenen Flicheninhalt 
des spharischen Bildes zu. Bildet man das Verhiltnis der Gesamtkrimmung 
eines Flachenstiicks zum Inhalt des Flachenstiicks und làBt dann das Flachen- 
stuck sich auf einen Punkt P zusammenziehen, so nennt man den Limes, dem 
dabei das genannte Verhiltnis zustrebt, das KrimmungsmaB % der Flache 
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im Punkt P. Aus dieser Erklirung folgt sofort, daB die Gesamtkriimmung 
IL: do geschrieben werden kann, wobei die Integration iiber das ganze Flàchen- 
stuck (Flichenelement do) zu erstrecken ist. Nach Herleitung der Darstellung 
k=1/R,R, (RF, die Hauptkriimmungsradien) stellt Gauss k in der Gestalt 
= (DD'— D'?)/(EG— F?) dar. Dabei sind D, D', D’ nichts anderes als die 
heute meist mit L, M, N bezeichneten Koeffizienten der zweiten Fundamental- 
form und £, F, G die Koeffizienten der ersten Fundamentalform, d.h. des Qua- 
drats des Linienelementes ds? = Edp? + 2Fdpdg + Gdq?. Dies wird erst nach- 
traglich gezeigt. Aus dieser Darstellung von % folgert Gauss, daB % allein durch 
E, F, G und ihre partiellen Ableitungen erster und zweiter Ordnung nach p 
und q ausdriickbar ist. Da ds biegungsinvariant ist, hat er damit den von ihm 
« theorema egregium » genannten Satz erhalten: Wenn eine krumme Flache 
auf eine andere abgewickelt werden kann, so bleibt damit das Kriimmungs- 
maf in den einzelnen Punkten ungeindert. Analoges ergibt sich daraus fiir 
die Gesamtkrimmung. 


GAUSS wendet sich dann Untersuchungen iiber geodatische Linien zu. Ihre 
Differentialgleichung wird aufgestellt und der Satz hergeleitet: Hat man auf 
einer Flache ein Feld von geodatischen Linien (eingeschlossen den Fall, dab 
alle diese Geodatischen durch einen Punkt gehen), so schneiden zwei beliebige 
Orthogonaltrajektorien dieses Feldes auf allen geodatischen Linien des Feldes 
gleiche Bogen aus. AnschlieBend betrachtet Gauss auf der Flache ein Dreieck, 
dessen Seiten geodatische Linien sind. Die Winkel in den Ecken seien A, B, C. 
Er zeigt dann, daB die Gesamtkriimmung dieses Dreiecks gleich A+B+C—z 
ist. Dies ist eine weitgehende Verallgemeinerung eines bekannten sehr elemen- 
taren Satzes der spharischen Trigonometrie. — LEGENDRE hatte die Frage ge- 
stellt, welche Beziehungen zwischen den Winkeln A, B, C eines spharischen 
Dreiecks und den Winkeln A*, B*, C* des ebenen Dreiecks mit denselben Seiten- 
lingen bestehen. Er hatte gefunden, daS fiir kleine spharische Dreiecke in 
guter Naherung ist A— A* = B— B* = C— C*= 3(A+B+C—na). Diesen 
Satz kann jetzt Gauss verallgemeinern auf von geodatischen Linien gebildete 
kleine Dreiecke auf beliebigen Flachen. Die Differenzen A— A*, B— B*, 
CO — C* sind dann nicht gleich, sondern hangen vom Krimmungsmaf der 
Fliche in den Ecken des Dreiecks ab. GAuss bemerkt aber, daB fur Flachen, 
die wenig von der Kugel abweichen, der Unterschied der von ihm angege- 
benen Werte dieser Differenzen gegen den Legendreschen Wert fast unmerk- 
lich ist. Er belegt das durch das Beispiel des gròBten von ihm gemessenen 
Dreiecks mit den Eckpunkten Hohehagen, Brocken und Inselsberg (Seiten- 


lingen etwa 69 km, 85 km, 107 km). 


Wir miiBen es uns versagen, auf die spàteren, nur im Nachla8 erhaltenen 
Untersuchungen von GAuss zur Differentialgeometrie einzugehen. Ebenso 
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iibergehen wir die aus physikalischen Fragestellungen entsprungenen topo- 


logischen Untersuchungen. Nur auf eines sei hingewiesen: GAuss hat Raàume 
beliebiger Dimensionszahl in seine Betrachtungen einbezogen. Er steht damit 
allein unter allen Mathematikern seiner Zeit. 


* kK 


Die ausgedehnten Beschiéftigungen von GAUSS mit physikalischen Fragen 
haben als eigentlichen Mittelpunkt seine theoretisch-physikalischen Unter- 
suchungen. Wir beginnen mit der kurzen Besprechung des von ihm entdeckten 
Prinzips des kleinsten Zwanges, um dann zu den wichtigen Untersuchungen 
zur Potentialtheorie ilberzugehen. 


LAGRANGE hatte in seiner Mécanique analytique die Statik auf das Prinzip 
der virtuellen Verriickungen, die Dynamik auf das d’Alembertsche Prinzip 
gegriindet. GAuss gibt nun in seiner Abhandlung « Uber ein neues allgemeines 
Grundgesetz der Mechanik » (Werke V) von 1829 das sog. « Prinzip des kleinsten 
Zwanges » an, das Statik und Dynamik zugleich umfaBt. Er begrilndet es, 
indem er zeigt, daB unter den fiir das d’ Alembertsche Prinzip geltenden Voraus- 
setzungen dieses aus seinem neuen Prinzip folgt. Das bedeutet natirlich nicht 
die Gleichwertigkeit der beiden Grundsàtze. Wir wissen heute, daB das Ganb- 
sche Prinzip viel weiter reicht und wohl das umfassendste aller mechanischen 
Prinzipien ist. Damit entfallt aber die Frage nach der Beweisbarkeit, es ist eben 
das Grundaxiom der Mechanik und kénnte als solches nur aus gleichwertigen 
Axiomen deduziert werden. Die von GAUSS gegebene Begrindung kann nur 
darlegen, wie man folgerichtig auf das neue Prinzip kommt. Gegeben seien » 
Massenpunkte (x;, y;, <;) mit den Massen m;. Auf diese mégen die Krafte 
(X;, Y;, Z;) wirken. Bedeuten, wie ùblich, Punkte Ableitungen nach der Zeit, 
so nennt GAUSS Zwang den Ausdruck 


3=>Z-{(K,-md}+(Y,- mp4 (A- me)? 


Gauss geht nun von einem gegebenen Zustand des Punktsystems, d.h. von 
gegebenen (x;, y;, 2) und (#,;,%;, 2;) aus und fordert, daB die (%,, };, #;) so 
gewahlt werden, dass Z unter Beriicksichtigung der Nebenbedingungen (ein- 
schlieBlich ihrer Ableitungen nach der Zeit) ein Minimum wird. Die Neben- 
bedingungen kònnen Gleichungen und Ungleichungen sein. Es wird nicht ge- 
fordert, wie beim d’Alembertschen Prinzip, daB sie in den Differentialen der 
Koordinaten linear sind. Das GauBsche Prinzip ist also, anders als die ge- 
wohnten Prinzipe, kein Variationsprinzip, sondern ein Minimalprinzip. Formal 
ordnet es sich den Formeln der Ausgleichrechnung unter. Gauss bemerkt, 
daB es merkwirdig sei, da® die Natur die freien Bewegungen eines mechanischen 
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Systems unter dem Druck von Bedingungen ebenso modifiziert, wie der rech- 
nende Mathematiker Erfahrungen iber unter einander abhangige GròBen aus- 
gleicht. Denn letzterer verfahrt ja nach einem « Prinzip der Gerechtigkeit ». 

1830 veròffentliehte GAUSS seine « Principia generalia theoriae fluidorum in 
statu aequilibrii » (Werke V). Diese Arbeit befaBt sich mit der Theorie der 
Kapillarkràfte. Zu Grunde liegt die Annahme, daB die molekularen Krafte 
Zentralkràfte der Gestalt mymof(7.) sind, wobei m, und m, die Massen der 
Partikeln und r,, ihre Entfernung ist. Dabei soll f(r) mit wachsendem r auBer- 
ordentlich rasch gegen Null abnehmen. Ausgehend von endlich vielen Parti- 
keln und nachfolgendem Grenztibergang zum Kontinuum, liefert das Prinzip 
der virtuellen Verriickungen die Aussage, daB im Gleichgewicht das Potential 
der wirkenden Krafte einen stationàren Wert haben muB. Sind als Krafte 
zugelassen die Schwere, die molekulare Anziehung der Flissigkeitsteilchen auf- 
einander und die Anziehung der GefàBwand auf die Fliissigkeitsteilchen, so zer- 
fallt das zu bestimmende Potential in verschiedene einzeln bestimmbare 
Teile. Macht man noch Gebrauch davon, daB fiir «endliche » Entfernungen 
die Molekularkràfte Null sind, so erhalt man ftir das Potential X (die hier 
benutzten Bezeichnungen weichen von GAuss ab) den Ausdruck 


ey [=a LP + (a BYE YW 


Dabei bedeuten e die Dichte der FliùBigkeit, g die Schwerebeschleunigung, 
x, p, y experimentell zu bestimmende Konstanten, / die freie Oberflàche, 
G die von der FliBigkeit benetzte Flache der Wand, V das Volumen der 
FliBigkeit, dv das Volumelement und ¢ die vertikale Koordinate. Die wei- 
teren Ausfiihrungen dieser Abhandlung gehòren in die Variationsrechnung fir 
Doppelintegrale. Es sei fiir sie auf die ausfthrliche Darlegung von BoLza 


(Werke X,) verwiesen. 


In die Potentialtheorie gehòrt im Grunde schon die Arbeit uber die An- 
ziehung eines elliptischen Ringes, sodann die Abhandlung « Theoria attrac- 
tionis corporum sphaeroidicorum ellipticorum homogeneorum methodo nova 
tractata » (Werke V) von 1813. In beiden Arbeiten tritt der Begriff des Po- 
tentials noch nicht auf. Die Anziehung homogener Ellipsoide war ein altes, 
aber erst durch LAPLACE und Ivory geléstes Problem. Beide benutzen unend- 
liche Reihen in der der Zeit entsprechenden bedenkenlosen Weise. GAUSS 
geht hier ganz andere Wege. Br beginnt mit sechs Satzen uber Integrale ber 
eine beliebige geschlossene Fliche S. Die meisten sind im Grunde Verwand- 
lungen von Raumintegralen iber das Innere von S in Oberflàchenintegrale, 
ein uns heute ja vertrautes Verfahren. Wegen seiner spàteren Verwendung 
ist besonders bemerkenswert das Integral, welches die « scheinbare GròBe von S 
gesehen von einem Punkt M aus» (den « raumlichen Winkel», unter dem S 
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von M aus erscheint) angibt. Dieses ist, je nachdem ob M auBerhalb, auf 
oder innerhalb S liegt, gleich 0, oder 2 oder 47. Es ist zugleich das einfachste 
Doppelbelegungspotential. Aus den sechs Integralsatzen folgt dann die Losung 
des Ellipsoidproblems. 


Das Potential wird von GAuss erst eingefiihrt in der grundlegenden Abhand- 
lung « Allgemeine Lehrsitze in Beziehung auf die im umgekehrten Verhaltnisse 
des Quadrats der Entfernung wirkenden Anziehungs- und AbstoBungskrafte » 
(Werke V) von 1839. Sitze aus dieser Abhandlung teilte GAUSS schon frtiher 
mit in seinen Arbeiten zum Magnetismus. Er macht dort von ihnen wichtige 
Anwendungen. Der Abhandlung von GAuss geht zeitlich voran die Arbeit 
von G. GREEN « Essay on the application of mathematical analysis on the 
theories of electricity and magnetism» von 1828. Die Ergebnisse dieser 
Arbeit tiberschneiden sich mit denen der Arbeit von Gauss. Es steht fest, dab 
Gauss die Arbeit von GREEN nicht gekannt hat und daB die Verbreitung des 
Potentialbegriffes allein ihm zu danken ist. Der Name « Potential », der GAUSS 
und GREEN gemeinsam ist, tritt schon im 18. Jahrundert auf und entstammt 
der philosophischen Terminologie. 


GAUSS beginnt mit der Herleitung der Eigenschaften von Potentialen ràum- 
licher und flàchenhafter Massenverteilungen (Stetigkeits- und Unstetigkeits- 
verhalten der Potentiale und ihrer partiellen Ableitungen erster und zweiter 
Ordnung, Erfillung der Laplaceschen Differentialgleichung im massefreien 
Raum, der Laplace-Poissonschen innerhalb der anziehenden Massen), die nach 
ihm « in ihrem wesentlichen Inhalt nicht neu » seien. Ist V das Potential von 
Massen M,, die in den Punkten P, konzentriert sind, ebenso v das Potential 
von Massen m, in den Punkten p,, so ist Y M,w(P,;) = > mxV(p.). Durch 
Grenzibergang folgert GAUSS daraus den entsprechenden Satz fiir zwei Po- 
tentiale irgendwelcher (ràumlicher, flachenhafter oder punktférmiger) Massen- 
verteilungen. Ein wichtiger Spezialfall ist der Satz: Gegeben das Potential V 
irgendwelcher Massen und eine Kugel vom Radius R. Dann ist der Mittel- 
wert von V auf der Kugel (1/4rR*) [Vas = Vo + M,/R, wobei V, der Wert 
von V im Kugelmittelpunkt ist und M, der Teil der V erzeugenden Massen, 
der in der Kugel liegt. Untersuchungen von Integralen iiber eine geschlossene 
Oberflache S (innere Normale n) schlieBen sich an, wie z.B. | (0V/0n) aS und 
[V(aV/an) as (V irgend ein Potential). Hinzu treten Siatze iber Potentiale, 
die in Stiicken des Raums oder auf geschlossenen Flachen konstant sind. Ist 
insbesondere ein Potential V auf einer geschlossenen Fliche S konstant und 
liegen alle Massen, zu denen V gehért, im Innern (AuBeren) von S, 80 186004 
auBerhalb (innerhalb) von S konstant. GAUSS betrachtet weiter eine geschlos- 
sene Flache S und belegt diese mit Masse von iberall positiver Dichte m 
und der Gesamtmasse M. Hs sei V das Potential dieser Belegung und U eine 
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Stetige Funktion auf S. Bildet man fiir alle méglichen zulassigen Wahlen der 
Belegung m das Integral i (V-2U)mdS, so ist die Menge dieser Integrale 
nach unten beschrankt. Daraus schlieBt Gauss (und das ist im Grunde der- 
selbe anfechtbare SchluB, der beim Dirichletschen Prinzip benutzt wird), dak 
es eine spezielle Massenbelegung m gibt, die das Integral zu einem Minimum 
macht. Fur diese Belegung ist dann V— U = const iiberall da, wo m > 0 ist. 
Ist speziell U= 0 auf ganz S, so kann man zeigen, daB die zugehérige Minimal- 
belegung auf ganz S positiv ist. Das zugehérige Potential ist auf S konstant. 
Aus diesen beiden ausgezeichneten Belegungen làBt sich nun eine dritte her- 
stellen, derart, daB fir ihr Potential V* gilt V*—U=0 auf ganz 8S. Das 
bedeutet: Ist das U auf S vorgegeben, so gibt es stets ein Potential, das auf S 
diese Werte annimmt. Sind insbesondere U die Werte, welche das Potential 
irgendwelcher ganz innerhalb S (auBerhalb S) gelegener Massen auf S annimmt, 
so gibt es eine Flachenbelegung von S, deren Potential auBerhalb S (innerhalb S) 
mit dem Potential der gegebenen Massen ibereinstimmt. Der NachlaB von 
GAUSS zeigt, daB er diese Untersuchungen weitergefiihrt und sich auch mit 
dem logarithmischen Potential beschaftigt hat. Dessen Zusammenhinge mit 
der Funktionentheorie und der konformen Abbildung waren ihm bekannt. 


Als 1829 der belgische Physiker QUETELET im Garten der Goéttinger Stern- 
warte magnetische Beobachtungen machte, wies GAUSS ihn darauf hin, daB 
man die Schwingungsdauer eines Magneten aus den Zeiten der gréBten Elon- 
gationen nur sehr ungenau ermitteln kònne. Empfehlenswerter sei, Beobach- 
tungen in der N&he der Ruhelage zu verwenden. Dieses Prinzip, zur genauen 
Messung von Zeiten eine bewegte Marke dort zu beobachten, wo ihre Ge- 
schwindigkeit am gréBten ist, zur genauen Feststellung des Ortes aber zu den 
Zeiten kleinster Geschwindigkeit zu messen, ist heute allen gelaufig. In allge- 
meine Aufnahme wurde es gebracht durch GAuss’ « Anleitung zur Bestimmung 
der Schwingungsdauer einer Magnetnadel » (Werke V) von 1837. Diese Arbeit, 
die sich mit schwach gedampften Schwingungen befaBt, ist wohl eine der 
ersten, welche die Theorie zur zweckmafSigen Einriehtung des Beobachtungs- 
verfahrens nutzbar zu machen versucht. Die Regeln fur die Bestimmung der 
Gleichgewichtslage und zur Bestimmung der Dampfung mittels des loga- 
rithmischen Dekrements stammen aus ihr. Zu diesen Regeln treten noch die 
anzubringenden Korrekturen: Ermittlung der Schwingungsdauer der unge- 
dimpften aus der Dauer der gedampften Schwingung, Verànderung der Gleich- 
gewichtslage durch diese Reduktion, Ubergang zu unendlich kleinen Schwin- 
eungen, EinfluB der Torsion des Aufhingefadens u.s.w.. Nichtberucksichtigt 
ist bei Gauss lediglich die Reduktion auf den luftleeren Raum. 


Das Interesse von GAUSS an Fragen des Magnetismus und der Elektrizitats- 
ehre fiel in eine Zeit stiirmischer Entwicklung dieser Disziplinen: 1820 Ent- 
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deckung des Elektromagnetismus durch OERSTEDT, 1820/21 Biot-Savartsches 
Gesetz, 1822/24 grundlegende Arbeiten AMPERES, 1827 Ohmsches Gezetz, 
1831 Entdeckung des Elektromagnetismus durch FARADAY. Schon 1831 
wurde die magnetische Deklination in Géttingen regelmaBig beobachtet 
und ihre Stérung durch ein Nordlicht bemerkt. Entscheidend war aber das 
Interesse, das ALEXANDER Vv. HumMBOLDT als Geograph und Geophysiker den 
Erscheinungen des Erdmagnetismus entgegenbrachte. Es gelang ihm, die 
Mitwirkung von Gauss fiir diese Forschungen zu erlangen. Dabei handelte 
es sich zunàchst um die Sammlung eines hinreichend groBen Beobachtungs- 
materials, die Ausgestaltung der Apparate und Beobachtungsmethoden und 
die zweckmàABige Hinrichtung und Organisation der Beobachtungen. Die 
Fuhrung lag bald ganz allein bei GAuss und seinem Mitarbeiter, dem 1831 
nach G6ttingen berufenen Physiker W. WEBER. Zur Unterrichtung der Mit- 
arbeiter und zur Verbreitung der Ergebnisse ihrer Forschungen lieBen Gauss 
und WEBER auf ihre Kosten eine Zeitschrift « Resultate aus den Beobachtungen 
des magnetischen Vereins » erscheinen, die es auf sechs Bande brachte (1836- 
1841). Der « Magnetische Verein » beruhte auf véòllig freier Zusammenarbeit; 
es handelte sich also um. eines der frihesten Beispiele dessen, was man heute 
« teamwork » nennt. Es wurden Beobachtungen nicht nur aus Europa, sondern 
von der ganzen Erde, vom Nordkap bis zum Kap der guten Hoffnung, aus 
Nord- und Sidamerika, aus Sibirien, aus Sùd- und Ostasien und Australien 
bis nach Tahiti hin zusammengetragen. Beobachtet wurden zunachst Dekli- 
nation, Horizontalintensitàt und Inklination. Die Beobachtungen der Inkli- 
nation traten bald zuriick, da sie sich nicht mit der gleichen Genauigkeit messen 
lieB, wie Deklination und Horizontalintensitàt und da die potentialtheore- 
tischen Untersuchungen von GAUSS erwiesen, daB man auf die Messung der 
Inklination verzichten konnte, allerdings unter der Annahme, daB der Erd- 
magnetismus seinen Sitz véllig im Innern der Erde hat. Es sei vorausgenommen, 
daB GAUSS die damit gestellte Randwertaufgabe auch zahlenmaBig gelést hat. 
Er hat die Ergebnisse seiner Rechnungen in dem « Atlas des Erdmagnetismus » 
(Werke XII) von 1840 unter verschiedenen Gesichtspunkten kartenmafig 
dargestellt und die errechneten und beobachteten Werte fiir Deklination, 
Inklination und Horizontalintensitàt tafelmaBig verzeichnet und verglichen. 
Die Lésung der Randwertaufgabe beruht hier auf einer Entwicklung des Po- 
tentials in eine Reihe nach allgemeinen (Laplaceschen) Kugelfunktionen, von 
der Gauss die (iibrigens physikalisch deutbaren) Glieder bis zur vierten Ordnung 
beibehalt. Weiter zu gehen erlaubte die Genauigkeit der Beobachtungen 
nicht. Fur die mitgenommenen Glieder werden die Genauigkeitsmingel der 
Beobachtungen kompensiert durch ihre verhàltnismaBig groBe Zahl- Diese 
Rechnung ergab, dab die Erde als Magnet genau zwei Pole besitzt. Es war 
ein Triumph der Rechnung, daB diese Pole von Forschungsreisenden in nicht 
allzugroBer Entfernung vom berechneten Ort wirklich aufgefunden wurden. 
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Das anschauliche Bild, mit dem Gauss bei seinen Uberlegungen iber den 
Magnetismus arbeitet, ist das zweier magnetischer Fliissigkeiten. Bei dieser 
Vorstellung mu8 noch erklirt werden, was ein magnetischer Pol ist. Dazu 
ersetzt GAuSS die ràumlich verteilten magnetischen Fliissigkeiten durch die 
im Aufenraum gleichwertige, d.h. dasselbe Potential liefernde, Flachenbele- 
gung einer alle magnetischen Fliissigkeiten einschlieBenden geschlossenen Flache. 
In der « Intensitas vis magneticae terrestris ad mensuram absolutam revocata » 
(Werke V) von 1832 behandelt Gauss die Festsetzungen, die nòtig sind, um 
an verschiedenen Orten oder zu verschiedenen Zeiten durchgefiihrte magne- 
tische Messungen vergleichbar zu machen. Dabei schlieBt er sich an das fran- 
zosische metrische System an, wohl weil dieses damals das einzige war, das 
in einem ausgedehnten Gebiet Geltung besaB und durch UrmaBe hinreichend 
gesichert war. Die Quantitàt des Magnetismus bestimmt er durch die Kraft- 
wirkung unter Zugrundelegung der von CouLomMB festgestellten und von ihm 
noch nachgepruften Tatsache, daB fir die magnetische Anziehung und Ab- 
stoBung formal das Newtonsche Attraktionsgesetz gilt. Die magnetische 
Einheit ibt dann in der Entfernung 1mm auf die Masse 1 mg die Kraft 
1mg-mm:s~’ aus. Der bei dem Anziehungsgesetz noch mégliche, der Gra- 
vitationskonstanten entsprechende Faktor wird zu 1 normiert. Die Art und 
Weise, wie Deklination und Horizontalintensitét nach den Vorschriften von 
GAUSS bestimmt werden, ist heute Allgemeinbesitz und braucht wohl nicht 
geschildert zu werden. Es sei auf einige Umstinde aufmerksam gemacht. Da 
GAUSS Prazisionsmessungen anstrebt, benutzt er die Ablesung einer Skala 
mit Spiegel und Fernrohr, die er unabhangig von POoGGENDOREF erfunden hatte. 
Ferner bringt er bei Schwingungsbeobachtungen stets die friher erwahnten 
Korrekturen an. Wegen der thermischen Abhangigkeit des Magnetismus sorgt 
er fiir méglichste Konstanz der Temperatur. Die Beobachtungen der Dekli- 
nation an verschiedenen Beobachtungsorten ergaben, dai die bedeutenden 
Schwankungen der Deklination in kurzen Zeiten keine Stérungen lokaler Natur 
waren. Sie traten bei allen Beobachtungsorten in nahezu der gleichen GròBe 
und zur gleichen Zeit auf, 


GAUSS vermutete, daB entsprechendes auch fiir die Horizontalintensitat 
gelten miisse. Bei dem iiblichen MeBverfahren fiir diese lieB sich das nicht 
feststellen, da wegen der verhaltnismaBig langen Dauer der Messung diese nur 
einen Mittelwert ergeben kann. In dem Bifilarmagnetometer schuf sich GAUSS 
ein Instrument zur Messung rascher Schwankungen der Horizontalintensitat. 
Dabei ist dieses Instrument auch zu anderen Messungen brauchbar, Trotz 
seiner Erkenntnis, daB die Messung der Inklination nicht mit der gleichen 
Schirfe méglich ist, wie die der beiden anderen Hlemente des Erdmagnetismus 
und trotz ihrer geringeren Wichtigkeit hat sich GAUSs auch um ihre Bestim- 
mune bemiiht. Als geeignetstes Instrument findet er den Erdinduktor. 
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Fine wesentliche Hilfe bei seinen Untersuchungen war fiir Gauss der Bau 
eines eisenfreien magnetischen Observatoriums im Jahr 1833. In seiner Arbeit 
« Allgemeine Theorie des Erdmagnetismus » (Werke V) von 1838 gibt GAUSS 
die Zusammenstellung der potentialtheoretischen und anderen Satze, die wir 
im Verlauf unserer Darstellung schon erwàhnt haben, so daB wir auf sie nicht 
weiter einzugehen brauchen. Manche Einzelheit ist allerdings bemerkenswert. 
So benutzt er z.B. den Satz, daB die Arbeit auf einem geschlossenen Weg im 
Potentialfeld Null ist, zu folgender Berechnung, bei der die auftretenden In- 
tegrale numerisch angenàhert werden. Er betrachtet das erdmagnetische Feld 
und berechnet die Horizontalintensitàt in Paris aus der Deklination in Paris 
und den Deklinationen und Horizontalintensitàten in Gòttingen und Mailand. 
Integrationsweg sind die Seiten des Dreiecks Géottingen-Mailand-Paris. Die 
Ubereinstimmung des errechneten Wertes mit dem beobachteten ist tber- 
raschend gut. 


Schon bald nach Beginn seiner Untersuchungen zum Magnetismus stellte 
GAUSS seine ersten Versuche zur Elektrizitàtslehre an. Auch hier hat er seine 
wichtigsten Ergebnisse nicht veròffentlicht. Alles hierher Gehòrige enthalten 
die Werke V und Werke XI. GAUSS begann damit, daf er tber den Magneten 
des Magnetometers eine Spule schob und es dadurch in ein Galvanometer 
oder, nach seiner Bezeichnung, in einen « Multiplikator » verwandelte. Die 
ersten umfangreichen Versuche galten der Bestàtigung des Ohmschen Gesetzes 
und im Zusammenhang damit dem Nachweis der damals noch fraglichen Iden- 
titàt von galvanischer Elektrizitàt, Thermoelektrizitàt und Reibungselektri- 
zitàt. GAUSS pruft auch durch Einschaltung mehrerer Multiplikatoren in seine 
sehr langen Leitungen nach, ob die Stromstàrke an allen Stellen der Leitung 
die gleiche ist oder ob sich ein Abfall in ihr zeigt. Im Zusammenhang mit der 
Konstanz der Stromstàrke zeigten diese Versuche, daB sich die Fortpflanzungs- 
geschwindigkeit der Elektrizitàt in Leitern mit den GAuss zur Verfiigung 
stehenden Hilfsmitteln nicht messen lieB. Das stand im Einklang mit der 
Feststellung von WHEATSTONE, daf diese Geschwindigkeit von der GròBen- 
ordnung der Lichtgeschwindigkeit ist. Bei seinen Untersuchungen fand GAUSS 
auch die spàter von KIRCHHOFF wieder entdeckten Gesetze iiber das Verhalten 
elektrischer Stròme in Leitungsnetzen und die Herleitung dieser Gesetze aus 
einem einzigen Minimalprinzip. Die physikalische Deutung dieses Prinzips, 
daB bei der tatsàchlichen Stromverteilung die Joulesche Warme ein Minimum 
wird, blieb ihm unbekannt. Nicht eingehen kònnen wir auf Bemerkungen 
von GAUSS, in denen er mit divinatorischer Sicherheit richtige Erkenntnisse 
niedergelegt hat, aus denen aber keine Arbeiten entsprungen sind, so z.B. tiber 
die Voltasche Spannungsreihe und die Kontaktelektrizitàt. Hier zieht er zur 
Erklàrung Molekularkràfte heran. 
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Zur Ausfilhrung der Versuche iber das Ohmsche Gesetz hatte W. WEBER 
die Sternwarte mit dem Physikalischen Institut durch Hin- und Rickleitung 
verbunden. Schon 1833 kam Gauss darauf, diese Leitung fiir eine telegra- 
phische Verbindung mittels galvanischer Elektrizitàt zu benutzen. Dies erwies 
sich als unzweckmabig, da die Zeichengebung zu langsam war. Erst als GAUSS 
1835 darauf kam, zum Telegraphieren InduktionsstéBe zu benutzen, die er 
durch Hin- und Herschieben einer Spule auf einem Magneten erhielt (eine 
solehe Einrichtung nennt er « Induktor »), wurde die Nachrichtentibermittlung 
hinreichend schnell. Verbessert wurde sie noch dadurch, daB Gauss die 
Schwingungen des Galvanometers auf der Empfangsseite durch eine Wirbel- 
stromdampfung rasch abklingen lieB. Gauss begniigte sich nicht mit einer 
improvisierten Apparatur, sondern vereinigte alle Teile zu einem zweckmabig 
eingerichteten Instrument. So wenig GAUSS vom wissenschaftlichen Standpunkt 
aus Wert auf seine Erfindung legte, so sehr war er sich ihrer praktischen Be- 
deutung bewu8ft. Er nahm Verhandlungen auf, um sie fiir den Zugsicherungs- 
dienst der damals im Bau befindlichen Eisenbahn Leipzig-Dresden nutzbar 
zu machen. N&heres ist nicht bekannt, da die Akten vernichtet wurden. 


Die groéBte Leistung von GAUSS in der Elektrizitàtslehre ist seine Unter- 
suchung der Induktionserscheinungen. Die erste Beobachtung ist, daB die 
Schwingungen eines Galvanometers rascher zur Ruhe kommen (ein erh6htes 
logarithmisches Dekrement besitzen), wenn der Stromkreis in dem die Gal- 
vanometerspule liegt, geschlossen wird. Bei Serienschaltung von mehreren 
Galvanometern beobachtet Gauss, daB Schwingungen eines Galvanometers 
bei den anderen erzwungene Schwingungen mit Phasenverschiebung hervor- 
rufen, die sich den Figenschwingungen tiberlagern. Aus dem Briefwechsel mit 
OLBERS geht hervor, daB Gauss und WEBER 1835/36 unter Verwendung per- 
manenter Magnete eine magnetelektrische Maschine bauten. Diese benutzten 
sie zu mannigfachen Versuchen, z.B. zur Zerlegung des Wassers. Uber Ein- 
zelheiten sind wir nicht geniigend orientiert. Diese Maschine erscheint als 
Fortentwicklung der von GAUSS viel benutzten Induktoren. GAUSS’ theore- 
tische Entwicklungen zur Induktion schlieBen an AMPERE an. Dieser hatte 
die magnetische Wirkung eines geschlossenen Stromes ersetzt durch die einer 
in den Stromkreis eingespannten magnetischen Doppelschieht. Davon macht 
Gauss Gebrauch, um die Wirkungen zweier geschlossener Stromkreise auf- 
einander bei Schwankungen der Stromstàrke in dem einem Stromkreis zu 
untersuchen. Das Potential der erwaàhnten Doppelschicht in einem Punkt P 
ist nichts anderes, als die Stromstirke J mal der scheinbaren GròBe oder, wie 
die Physiker sagen, dem ràumlichen Winkel, unter dem die Doppelschicht 
von P aus erscheint. Befindet sich in P ein positiver magnetischer Einheitspol 
und lift man P auf einer geschlossenen Kurve wandern, so ergibt sich aus 
den friiher erwihnten Figenschaften der scheinbaren GròBe, dab sich dabei 
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bei jedem Durchgang durch die Doppelschicht ein Sprung des Potentials um 
+4aJ ergibt, d.h. daB die Arbeit bei dieser Wanderung A= 4maJ ist, wenn 
m die Anzahl der Umschlingungen des Stromkreises durch die von P durch- 
laufene Kurve ist. So wird GAuss zu dem topologischen Problem gefuhrt, 
die Anzahl der Umschlingungen zweier geschlossener Kurven zu ermitteln. 
Die Formel fiir A enthàlt in moderne Form umgeschrieben, das Grundgesetz 
des Elektromagnetismus (m = 1). Im Zusammenhang damit tritt bei GAUSS 
auch das Vektorpotential eines linearen Stromes auf. 


Die Untersuchungen von Gauss iber die Elementargesetze der Kraft 


zwischen Stromelementen sind fiir uns nur von historischer Bedeutung. Wohl ~ 


das wichtigste Ergebnis von GAUSS ist folgendes: Ein Element eines Stromes 
erfàhrt eine Kraft von dem Magnetfeld eines anderen Stromes. Diese Kraft 
besitzt ein Potential. GAUSS erhalt dieses, indem er mit Hilfe des Biot-Savart- 
schen Gesetzes die virtuelle Arbeit bei einer Verriickung des Stromelementes 
berechnet. Mit der Aufstellung dieses « elektrodynamischen Potentials » und 
den aus ihm gezogenen physikalischen Folgerungen wurde von GAUSS eine 
der bedeutendsten Leistungen von FRANZ NEUMANN um 12 Jahre voraus- 
genommen. 

Die Zusammenarbeit zwischen GAUSS und dem auf seine Veranlassung nach 
Gottingen berufenen W. WEBER war so eng, daB sich in vielen Fallen der 
Anteil beider nicht genau scheiden 148t. Doch war bis 1840 GAUSS in dieser 
Arbeitsgemeinschaft zweifellos der fithrende Kopf. Diese Zusammenarbeit 
wurde jàh unterbrochen durch den Staatsstreich des Kéònigs ERNST AUGUST 
von Hannover bei seiner Thronbesteigung 1837, der die geltende Verfassung 
aufhob und durch ihre reaktionàre Vorgàngerin ersetzte. Da WEBER zu den 
sieben protestierenden Géttinger Professoren gehérte, wurde er seines Amtes 
entsetzt. Zwar blieb er bis zu seiner Berufung nach Leipzig noch in Gottingen, 
aber dann fanden durch die Trennung die physikalischen Arbeiten von GAUSS 


ein Ende. WEBERS Riickkehr nach Gottingen 1849 fiihrte zu einer Erneuerung | 


der Zusammenarbeit, aber mit geringerer Wirkung. 


Die wichtigen Untersuchungen von GAuvss zur geometrischen Optik hangen | | 


mit seiner astronomischen Tatigkeit zusammen. Ausgehend von der Dioptrik 
EULERS berechnet er 1810 fiir REPsoLD ein Fernrohrobjektiv, das nach einem 
ersten MiBerfolg des Schleifers hervorragend gut ausfallt. Wie in diesem Fall 


regten ihn immer dufere Anlasse zur Beschaftigung mit Fragen der Optik an. 


So war es bei den dialytischen Fernrohren. Diese wurden konstruiert, weil 
damals die Herstellung geniigend groBer einwandfreier Stiicke von Flintglas 
ein technisch schwieriges Problem war. Ein anderer AnlaB war die theoretisch 


mangelhafte Bestimmung der Brennweite des Kénigsberger Heliometerobjektivs 


durch BpsseL. Die Riickwirkungen der Wellennatur des Lichtes auf die Ab- 
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bildung durch optische Instrumente hat Gauss zu betrachten begonnen. Unter 
dem Druck anderweitiger Arbeiten brach er diese Untersuchungen ab und 
kam nie mehr darauf zuriick. Daher befriedigen die ganz im Bereich der geo- 
metrischen Optik verbleibenden Entwicklungen mit ihrer Beschrankung auf 
achsennahe Bindel nicht mehr. Dennoch haben sie in der geschichtlichen 
Entwicklung eine duBerst wichtige Rolle gespielt. 

Entsprechend den Gewohnheiten der Zeit berechnet Gauss die Objektive 
zunachst so, da die Bildfeldwélbung fiir die Randstrahlen und der chroma- 
tische Fehler fiir achsennahe Strahlen beseitigt wird. Man kann nun zu jeder 
Kronglaslinse einer gegebenen Brennweite eine Flintglaslinse angeben, dab 
diese Forderungen erfillt sind. Dabei ist das Verhaltnis der Kriimmungs- 
radien der Kronglaslinse noch willkirlich. Das irrige Ergebnis eines anderen 
Forschers, daB man diese Tatsache nicht zu einer weiteren Verbesserung der 
Objektive ausnutzen kénne, veranlaBte GAuss zur Nachprifung. In der 
Arbeit « Uber die achromatischen Doppelobjektive, besonders in Riicksicht 
der vollkommenern Aufhebung der Farbenzerstreuung » (Werke V) von 1817 
zeigt er, dab bei geeigneter Wahl der Krimmungsradien der Kronglaslinse der 
chromatische Fehler auch fiir die Randstrahlen beseitigt werden kann. Die 
erfolgreiche Herstellung eines solchen Objektivs gelang erst 1860 STEINHEIL, 
der darlegt, daB die friheren Miferfolge teilweise in technischen Schwierig- 
keiten, teilweise in einem ungentigenden Verstàndnis der Entwicklungen von 
GAUSS ihren Grund hatten. GAUSS iberlegt auch, ob es zweckmaBig sei, die 
moglichen Korrekturen gerade ftir die Zentral- und die Randstrahlen durch- 
zufiihren und bejaht das zundchst, gibt aber spàter eine Richtigstellung. 

Von 1840-1841 beschaftigt sich GAuss mit den dialytischen Fernrohren. 
Bei diesen war die Flintglaslinse von der Kronglaslinse getrennt und an eine 
Stelle verlegt, an der das Lichtstrahlenbindel sehr eng ist. Die Trennung 
von Kron- und Flintglaslinse nòtigt dann dazu, diese letztere durch eine Kron- 
glaslinse zu erginzen. GAUSs bemerkt sofort, daB ein solches Objektiv nicht 
achromatisch sein kònne. Er findet aber bald, dafi dies ganz unwesentlich 
ist, sondern daB es darauf ankommt, daf das ganze, aus Objektiv und Okular 
(und u.U. auch dem Auge) bestehende optische System achromatisch ist. 


Die wichtigste optische Arbeit von GAuss sind die « Dioptrischen Unter- 
suchungen » (Werke V) von 1840. Zu diesen sei vorweg bemerkt, daf die 
Finfilhrung der Hauptebenen zur Erkenntnis fuhrt, da®B Gegenstandsraum und 
Bildraum kollinear auf einander bezogen sind. Es ist sehr merkwùrdig, daB 
dies bei Gauss nirgends gesagt wird, trotz des hieraus folgenden Gewinns an 
Einfachheit und Klarheit. Gauss verfolgt den Gang eines Lichtstrahls durch 
ein zentriertes System brechender Kugelflachen unter Beschrankung auf achsen- 
nahe Strahlen. Die rechnerische Verfolgung des Lichtstrahls durch das System 
ist eine elementare Aufgabe der analytischen Geometrie. Aus ihr kann man 
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entnehmen, wo das Bild P* eines auf dem einfallenden Strahl liegenden 
Punktes P liegt. Das Verhaltnis des Abstandes von P* von der Achse zum 
Abstand von P von der Achse nennt GAuss LateralvergròBerung. Der Ort 
der Punkte P, welche Bildpunkte P* mit der LateralvergròBerung 1 ergeben, 
ist eine zur Achse senkrechte Ebene, das gleiche gilt fiir die P*. Diese Ebenen 
sind die beiden Hauptebenen, ihre Schnitte mit der Achse die Hauptpunkte. 
Verlingert man einen einfallenden Strahl (ohne Brechung!) bis zu seinem 
Schnitt Q mit der ersten Hauptebene, so geht der ausfallende Strahl ruckwarts 
verlingert durch Q*. Daraus folgt, daB man das brechende System ersetzen 
kann durch eine einzige brechende Kugelflache von bekanntem Radius, wenn 
die Brechzahlen des ersten und letzten Mediums verschieden sind, und durch 
eine einzige unendlich dtinne Linse bekannter Brennweite, wenn die Brech- 
zahlen gleich sind. Es folgt die Erklarung der Brennpunkte und der Brennweiten 
‘als Abstànde der Brennpunkte von den zugeh6rigen Hauptpunkten. Besonders 
| behandelt wird der singulàre Fall, daB beide Brennweiten unendlich sind, wie 
es bei dem auf Unendlich eingestellten Fernrohr der Fall ist. 


Bei unserer Darstellung der wissenschaftlichen Leistungen von Gauss 
mufite eine Unzahl wichtiger und bedeutender Leistungen von GAUuss in der 
Mathematik und Physik ùbergangen werden, so z.B. seine Beschaftigung mit 
der Versicherungsmathematik. Jedoch werden die besprochenen Arbeiten wohl 
ein ausreichendes Bild von der einzigartigen wissenschaftlichen GréBe des 
Mannes ergeben. 
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